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Abstract. We consider discrete-time projective semilinear control systems 
ft+l = A(ut) ■ ft, where the states ft are in projective space RP d ~ 1 , inputs ut 
are in a manifold U of arbitrary dimension, and A: ti — > GL(d, R) is a fixed 
diffcrcntiable map. 

An input sequence (uo, ■ ■ ■ , njv— l) is called universally regular if for any 
initial state fo e MP d_1 , the derivative of the time- TV state with respect to the 
inputs is onto. 

In this paper we deal with the universal regularity of constant input se- 
quences (no , . . . , «o ) ■ Our main result states that for generic such control 
systems, all constant inputs of sufficient length N are universally regular, ex- 
cept for a discrete set. More precisely, the conclusion holds for a C 2 -open and 
C^-dense set of maps A. We also show that the inputs on that discrete set are 
nearly universally regular; indeed there is a unique non-regular initial state, 
and its corank is 1. 

In order to establish the result, we study the spaces of bilinear control 
systems. We show that the codimension of the set of systems for which the 
zero input is not universally regular coincides with the dimension of the con- 
trol space. The proof is based on careful matrix analysis and some elemen- 
tary algebraic geometry. Then the main result follows by applying standard 
transversality theorems. 



1. Introduction 

1.1. Basic definitions and some questions. Consider discrete-time control sys- 
tems of the form: 

(1-1) Xt+i = F(xt,Ut), (t = 0,1,2,...) 

where F : X x U — * X is any map. We will always assume that the space X of states 
and the space U of controls are manifolds, and that the map F is continuously 
differentiable. 

A sequence (xq, . . . , xn', uq, . . . , u/v-i) satisfying (jl.ip is called a trajectory of 
length N; it is uniquely determined by the initial state xq and the input (uq, . . . , Un-i)- 
Let <pN denote the time-JV transition map, which gives the final state as a function 
of the initial state and the input: 

(1.2) xjy = (f> N (xa;u , ...,un-i)- 

We say that the system (jl.ip is accessible from xq in time N if the set <^jv({^o} x 
IA N ) of final states that can be reached from the initial state xq has nonempty 
interior. 
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The implicit function theorem gives a sufficient condition for accessibility. If 
the derivative of the map 4>n(xq) •) at input (uq, . . . , ujv-i) is an onto linear map3 
then we say that the trajectory determined by (xq; uq, . . . , un-i) is regular. So the 
existence of such a regular trajectory implies that the system is accessible from xq 
in time N. 

Let us call an input (ito, . . . , itjv-i) universally regular if for every xq e X, the 
trajectory determined by (xo; uq, • ■ • , Ujv-i) is regular; otherwise the input is called 
singular. 

This concept is central in the present paper; it was introduced by Sonta^ in |Soj 
in the context of continuous-time control systems. The discrete-time analogue was 
considered by Sontag and Wirth in |SWj . They showed that if the system (jl.ip is 
accessible from every initial condition x$ in uniform time N then universally regular 
inputs do exist, provided one assumes the map F to be analytic. In fact, under 
those hypotheses they showed that universally regular inputs are abundant: in the 
space of inputs of sufficiently large length, those that are not universally regular 
form a set of positive codimension. 

In this paper, we are interested in control systems (jl.ip where the next state 
Xt+i depends linearly on the previous state Xt (but non-lincarly on Ut, in general). 
This means that the state space is K d , where d > 2 and K is either R or C and 
that (ll.ip now takes the form: 

(1.3) Xt+x = A(u t ) ■ x t , where A: Mat dxd (K). 

Following [CKlj . we call this a semilinear control system. 

In the case that the map A above takes values in the set GL(d, K) of invertible 
matrices, we consider the corresponding projectivized control system: 

(1.4) 6+1 = A(«t) • 6, 

where the states 6 take value in the projective space KP d_1 = K^/K*. We call 
this a projective semilinear control system. The projectivized system is also a useful 
tool for the study of the original system (JTT3J) : see e.g. |Wil ICK2] . 

Universally regular inputs for projective semilinear control systems were first 
considered by Wirth in |Wij . Under his working hypotheses, the existence and 
abundance of such inputs is guaranteed by the aforementioned result of [SW ; then 
he uses universally regular inputs to obtain global controllability properties. 

The purpose of this paper is to establish results on the existence and abundance 
of universally regular inputs for projective semilinear control systems. Differently 
from |SW1 IWi] . we will not necessarily assume our systems to be analytic. Let us 
consider systems (|1.4[) with K = R and A: U —* GL(d, R) a map of some class C r , 
with r 5= 1. To compensate for less rigidity, we do not try to obtain results that 
work for all C r maps A, but only for generic ones, i.e., those maps in a residua^ 
subset, or, even better, in an open dense subset. 

To make things more precise, assume IA is a C 00 (real) manifold without bound- 
ary]^ We will always consider the space C r (jU, GL(d, R)) endowed with the strong 
C r topologjQ. 

^This condition is usually written in terms of the rank of a certain matrix and it is usually 

called the rank condition. 

2 Sontag calls these inputs universally nonsingular; we follow the terminology of |Wil ICK21 - 
■^Recall that a subset of a Baire space is called residual if it is a countable intersection of open 

dense subsets. 

^Moreover, all manifolds are assumed to be Hausdorff paracompact with a countable base of 
open sets, and of finite dimension. 

^See e.g. |Hi| . Note that in the case that U is compact, this coincides with the usual uniform 
C r topology. 
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Hence the first question we pose is this: 

Taking N sufficiently large, is it true that for C r -generic maps A, the 
set of universally regular inputs in U N is itself generic? 
It turns out that this question has a positive answer. Actually, we show in [BG3 
that for r great enough, for maps A in a C r open and dense set, all inputs in 
U N are universally regular, except for those in a stratified closed set of positive 
codimension. So another natural question is this: 

Fixed parameters d, dimli, N, and r, what is the minimum codimen- 
sion of the set of singular inputs in U N that can occur for C-generic 
maps A: U — » GL(d, R)? 
This question seems to be very difficult. However, we do have a sharp estimate if 
we restrict ourselves to the subset of U N formed by non-resonant inputs, namely 
those inputs (uo, . . . , ujv-i) such that Ui ^ Uj whenever i ¥= j. To investigate what 
happens for resonant inputs is a much tougher job. 

In this paper we consider the most resonant case. Define a constant input of 
length N as an element oiU N of the form (w , "o, • • • > uo)- We propose ourselves to 
study universal regularity of inputs of this form. A possible interpretation is this: 
Suppose the system is controlled by a "lever" that is very hard to move. Then we 
want to know what positions of the lever are universally regular. For those positions 
it is possible to perturb the state of the system in any desired direction with only 
small moves on the lever. 

1.2. The main result. Our main result says that generically the singular constant 
inputs form a very small set: 

Theorem 1.1. Given d 5= 2 and m ^ 1, there exists N > 1 with the following 
properties. Let IA be a smooth m- dimensional manifold without boundary. Then 
there exists a C 2 -open C 00 -dense subset O of C 2 (U, GL(d, R)) such that for every 
system (|1.4I) with A e O, all constant inputs of length N are universally regular, 
except for those in a zero- dimensional (i.e., discrete) set. 

By saying that a subset O of C 2 (U, GL(d, R)) is C^-dense, we mean that for all 
r 2, the intersection of O with C r (U, GL(d, R)) is dense in C r (U, GL(d, R)). 

It is remarkable that the generic dimension of the set of singular constant inputs 
(namely, 0) does not depend on the dimension m of the control space U, neither on 
the dimension d— 1 of the state space. A partial explanation for this phenomenon is 
the following: First, the obstruction to universal regularity of the input (u, u, . . . , u) 
is the combined degeneracy of the matrix A(u) and of the derivatives of A at m. If 
m is small then the image of the generic map A will avoid too degenerate matrices, 
which increases the chances of obtaining universal regularity. If m is large then 
more degenerate matrices A{u) will inevitably appear; however the large number 
of control parameters compensates, so universal control is still likely. 

The singular inputs that appear in Theorem ll.il are not only rare; we also show 
that they are "almost" universally regular: 

Theorem 1.2 (Addendum to Theorem O . The set O a C 2 {U, GL(d,R) in The- 
orem can be taken with the following additional properties: If A e O and a 
constant input (u, . . . , u) of length N is singular then: 

1. There is a single direction £o 6 RP d_1 such the the corresponding trajectory 
of system (jl.4l) is not regular. 

2. The derivative of the map </>jv(£o; ') a t input (u,...,u) has coran^ 1. 



'The corank of a linear map L: V — > W is the number dim W — dimL(V). 
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To sum up, for generic systems (jl.4[) . the universal regularity of constant inputs 
can fail only in the weakest possible way: there is at most one non-regular state, 
which can be moved in all directions but one. 

Remark 1.3. We actu ally have a very precise description of the singular inputs that 
appear in Theorem We show t hat these singular inputs can be unremovable by 

perturbations, and therefore Theorem ll.ll is optimal in the sense that there are C 2 -open 
(actually even C -open) sets of maps A for which the set of singular constant inputs is 
nonempty. Also, by C 1 -perturbing any A in those C 2 -open sets, one can obtain an infinite 
number of singular constant inputs. In parti cular, it is not possible to choose O to be 
C 1 -open in the statement of the Theorem ll.il . See Appendix IbI. 

Remark 1.4. The integer N is a function of d and m we did not try to estimate precisely. 
However, we know that it is at most d 2 (see Remark I l.Tf ). 

Remark 1.5. In t he c ase of complex matrices (i.e., K = C), we have a corresponding 
version of Theorem ll.il where the maps A are analytic; see Appendix [cl. 

1.3. Reduc tion to the study of the set of poor data. The bulk of the proof 
of Theorem ll.ll consists on the computation of the dimension of certain canonical 
sets, as we now explain. 

We fixA:W^ GL(d, K) and consider the projective semilinear system f| 1 .4[) . 
Recall that 1-jet of the map A at a point u e U consists of the first order Taylor 
approximation of A around u. By the chain rule, the universal regularity of an 
input (uo, Mi, ... , uat-i) depends only on the 1-jets of A at points uq, . . . , u^v-i- 

Let us discuss the case of constant inputs (uq, ■ ■ ■ , uq). If we take local coordi- 
nates such that uq = and replace the matrix map A: IA — > GL(d, K) by its linear 
approximation, system (ll.4p becomes: 

(1-5) 6+1= (^A+f^u^C^Zt, (i=0,l,2,...), 

where A = A(uq) and Ci, . . . , C m are the partial derivatives at 0. This is the 
projectivization of a bilinear control system (see [El]). For these systems, the zero 
input is a distinguished one and the focus of more attention. 

To study system (|1.5|) it is actually more convenient to consider normalized 
derivatives Bj = CjA" 1 , which intrinsically take values in the Lie algebra gl(<i, IK). 
Consider the matrix data A = (A, B\, ... , B m ). We will explain how the universal 
regularity of the zero input is expressed in linear algebraic terms. Recall that the 
adjoint operator of A acts on gl(d, IK) by the formula Ad^-B) = ABA^ 1 . Consider 
the linear subspace Ajy(A) of gl(d, K) spanned by the matrices 

Id and (Ad^)^^), (i = 0, . . . , n - 1, j = 1, . . . , m). 

(The identity matrix appears because of the projectivization.) Then: 

Proposition 1.6. The constant input (0, ... ,0) of length N is universally regular 
for system (jl.5[) if and only if the space Ajy(A) acts transitively on the set of 
nonzero vectors. 

If An (A) acts transitively on IK^ for some N, then the data A is called rich; 
otherwise it is called poor. 

Remark 1.7. The spaces Ajv(A) form a nested sequence, which thus stabilize after 
finitely many steps. It is actually easy to see that stabilization occurs at most at time 
N = d 2 . Therefore there are two possibilities: either the zero input of length d 2 is univer- 
sally regular, or the zero inputs of all lengths are singular^ 



In other words, if you're old enough and still poor then you'll never get rich. 
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Let V m denote the set of poor data|f| A major part of our work is to study 
these sets. We prove: 

Theorem 1.8. The set V m is closed and semialgebraic, and its codimension in 
GL(d,R) x (g[(d,R)) m is m. 

Theorem 1.9. The set Vm. is algebraic, and its (complex) codimension in GL(d, C): 
(g[(d,C)) m is m. 



So Theorems ll.8l and ll.9l say how frequent universal regularity of the zero input 
is in the space of projective bilinear control systems (11.51) 

1.4. Ov erview of the proofs. Theorem ll.ll follow rather directly from Theo- 



rem 



1.8l by applying standard results from transversality theory. More precisely, 



the fact that the set Vm is semialgebraic implies that it has a canonical strati- 
fication. Thi s permits us to apply Thorn's jet transversality theorem and obtain 



Theorem 1.1 



On the other hand, Theorem 1 1.8l follows from its complex version Theorem 
by simple abstract arguments. 

Thus everything is based on Theorem One part of the result is easily ob- 
tained: we give examples of small disks of codimension m formed by poor data, so 
concluding that the codimension of Vm is at most m. 

To prove the other inequality, one could try to exhibit an explicit codimension m 
set containing all poor data. For m = 1 this task is feasible (and we actually 
perform it, because with these conditions we can actually check universal regularity 
in concrete examples). However, for m = 2 already the task would be very laborious, 
and to expect to find a general solution seems unrealistic. 

Our actual approach to prove the lower bound on the codimension of V) n is 
indirect. Crudely speaking, after careful matrix computations, we find some sets in 
the complement of V m that are reasonably "large" (basically in terms of dimen- 
sion). Then, by using some abstract results of algebraic geometry, we are able to 
show that Vm ^ is "small" , thus proving the other half of Theorem 

Let us give more detail about this strategy. We decompose the set V m = V m 
into fibers: 

Vm= |J {A}xV m (A), Vm(A) C [ Q l(d,C)] m . 

AeGL(d,C) 

It is not very difficult to show that for generic A in GL(<i, C), the fiber V m (A) has 
precisely the wanted codimension m. However, for degenerate matrices A, the fiber 
Vm(A) may be much bigger. (For example, one can show that if A is an homothecy 
and m < 2d — 3 then V m (A) is the whole [gl(d, C)] m .) In order to show that 
codimPm 3s m, we need to make sure that those degenerate matrices with do not 
form a large set. More precisely, we show that: 

(1.6) Vfce {0,...,m}, codim {A e GL(d, C); codim7' m (A) < m - k) 3* k. 

Let us explain how we prove (jl.6|) . In order to estimate the dimension of V m (A) 
for any matrix A e GL(<i, C), we consider a quantity r = r(A) which is the least 
number such that a rich data of the form (A, C\, . . . , C r ) exists. In particular, if 
r = r(A) ;g m then the following affine space 

(1.7) {{C 1 ,C 2 ,...,C r ,B r+1 ,...,B m ) ] Bje Q l{d,C)} 
is contained in the complement of V m (A). 



8 A more precise notation would be P^fjj- However, we can think d as fixed; on the other hand 
it is sometimes useful to change m. 
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In certain situations, if two algebraic subsets have large enough dimensions then 
they necessarily intersect; for example, two algebraic curves in the complex pro- 
jective plane CP 2 always intersect. This kind of phenomenon happens here: the 
dimension of the affine space (|1.7|) forces a lower bound for the codimension of 
V m (A). More precisely, applying a result from |BG1] we obtain: 

(1.8) codimP m (A) m + 1 - r(A). 

So we need to show that matrices A with large r(A) are rare. A careful matrix 
analysis provides an upper bound to r(A) based on the numbers and sizes of the 
Jordan blocks of A, and on the occasional algebraic relations between the eigenval- 
ues. This b ound together with (|1.8p implies (|1.6[) and therefore concludes the proof 
of Theorem Q. 

In fact, the results of this analysis are even better, and we conclude that the 
codimension inequality (11.6[) is strict when k ^ 1. This implies that poor data 



(A, Si, ... , B m ) for which the matrix A is degenerate form a subset of Vm with 
strictly bigger codimension. Thus we can show tha t th e poor data that appear 



generically are well-behaved, which leads to Theorem 1 1.2 



1.5. Other remarks. One can also study uniform regularity of periodic inputs of 
higher period. Using our results for constant inputs, it is not difficult to derive 
some (non-sharp) dimension bounds for singular periodic inputs. However, for 
highly resonant non-periodic inputs, we have no idea on how to obtain reasonable 
dimension estimates. 

As mentioned above, in paper [BG3] we have dimension estimates for general 
inputs. These estimates are basically obtained by avoiding highly resonant inputs 
(which have large codimension themselves) . Thus the results of |BG3| are indepen- 
dent from those of these paper. The proofs there are less involved from the point of 
view of matrix computations, but use more sophisticated transversality theorems. 

Of course, it would be interesting to consider these kind of problems for other 
Lie groups of matrices, but we won't pursue this issue here. 

1.6. Organization of the paper. Section contains some basic results about 
transitivity of spaces of matrices a nd it s rel atio n with universal regularity. We also 



obtain the easy parts of Theorems ll.8l and ll.9l . namely (semi)algebraicity and the 
upper codimension inequalities. 

In Section [3] we introduce the concept of rigidity, which is related to t he q uantity 



r(A) mentioned above. We state the central rigidity estimates (Theorem l3.7f ). which 
consist into two parts. The first and easier part is proved in the same Section 0, 
while the whole Section is devoted to the proof of the second part. 

Section [H] starts with some preliminaries in el eme ntary algebraic geometry. Then 
we use t he ri gidity esti mates to prove Theorem ll.9l following the strategy outlined 



for the proof of Theorem |L2 



above (§ ll.4f ). Theorem 1 1 . 8l f ollows easily. We also obtain a lemma that is needed 



In Section @ we collect some basic facts about stratifications and transversa lity. 



and the n apply them together with the previous results to obtain Theorems ll.l 



and 1.2 



The paper also has three short appendices. 

Appendix [X| basically reobtains the major results in the special case m = 1, 
w here we actually gain additional information of practical value: as mentioned in 
it is possible to describe explicitly w hat 1-je ts th e map A should avoid in 



order to satisfy the conclusions of Theorems ll.ll and [L2J . The arguments necessary 
for the nri = 1 case are much simpler and more elementary than those in Sections |3j 
tod Therefore the appendix is also useful to give the reader some intuition about 
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the general problem, and as a source of examples. Appendix [X| is written in a 
slightly informal way , an d it can be read after Section H (though the final part 
requires Lemmas |3.ll and 13.21 ) . 

In Appendix |B| we take a clo ser look to the generic singular constant inputs, 
and in particular justify Remark ll .31 We also discuss the generic validity of some 
control-theoretic properties related to accessi bilit y and regularity. 

Finally, in Appendix [c] we apply Theorem ll .91 to prove a version of Theorem Ll 
for holomorphic mappings. 



2. Preliminary facts on the poor data 

In this section, we review som e ba sic prop erties related to poorness, and prove 
the easy inequalities in Theorems ll.a and 1 1 .9 . 



2.1. Transitive spaces. Let E and F be finite-dimensional vector spaces over the 
field K. Let C(E, F) be the space of linear maps from E to F. A vector subspace 
A of C(E, F) is called transitive if for every v e E \ {0}, we have A • v = F, where 
A-v = {L(v); L e A}. 

Under the identification C(K m , K") = Mat mx „(K), we may also speak of tran- 
sitive spaces of matrices. 

Example 2.1. Recall that a Toephtz matrix, resp. a Hankel matrix, is a matrix of the 
form 

(t h ■■■ td-i \ / hi hd-i h d \ 



t- 



ti 



resp. 



hd 



hd-i 



\ t- d+ i ■•• ti to / y h d hd+i ■ ■ ■ h 2d -i ) 

The set of Toeplitz matrices and the set of complex Hankel matrices constitute examples 
transitive subspaces of K). Transitivity of the Toeplitz space is a particular case of 
Example 1 2. 2l . and transitivity of Hankel space follows from Remark l2.3l . For K = C, these 
spaces are optimal, in the sense that they have the least possible dimension; see |Az| . 

Example 2.2. A generalized Toeplitz space is a subspace A of Matdx^K) (where d 5= 2) 
with the following property: For any two matrix entries (ii, ji) and (12,32) which are not in 
the same diagonal (i.e., ii— ji 12— 32), the linear map (6i,j)i,j e A 1— » (bi 1 j 1 , bi 2 j 2 ) e C 2 
is onto. Equivalently, a space is generalized Toeplitz if it can be defined by a number 
of linear relations between the matrix coefficients so that each relation involves only the 
entries on a same diagonal, and so that the relations do not force any matrix entry to be 
zero. We will prove later (see § l3.3h that every generalized Toeplitz space is transitive. 

Remark 2.3. If A is a transitive subspace of C(E, F) and P £ C(E, E), Q e C(F, F) are 
invertible operators then P • A • Q := {PLQ; L e A} is a transitive subspace of C(E, F). 

Let us see that transitivity is a semialgebraic or algebraic property, according to 
the field. Recall that: 

• A subset of K n is called algebraic if it is expressed by polynomial equations 
with coefficients in K. 

• A subset of K™ is called semialgebraic if it is expressed by finitely many 
polynomial equations or inequalities with coefficients in M. 

Proposition 2.4. Let jVj® n>k be the set of (B u . . . , B k ) e [Mat mxn (K)] fe = K mnk 
such that span{£?i, . . . , B^} is not transitive. Then: 



1. The set Ml 



is semialgebraic. 



2. The set M^ n k is algebraic. 
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Proof. Consider the set of {B\, . . . , B k , v) e [Mat mxn (K)] fe x K™ such that 

spaxi{B 1 ,...,B k } -v ^K m . 

This is an algebraic set, because it is expressed by the vanishing of certain determi- 
nants. Taking IK = R and projecting this set along the R* fiber we obtain J\f^ n fc ; 
so, by the Tarski-Seidenberg theorem (see |BCR[ p. 26]), this set is semialgebraic, 
proving part [TJ 

To see part[2J we take K = C and projectivize the fiber, obtaining an algebraic 
subset [Mat mx „(C)] fc x CP™" 1 whose projection along the CP" -1 fiber is M { ^ n k . 
So part follows from the fact that projections along pro jecti ve fibers are closed 
maps with respect to the Zariski topology (see Proposition l5.ll below). □ 

Another important fact is that complex transitivity of real matrices is a stronger 
property than real transitivity: 

Proposition 2.5. The realpart ofAff£p n k (that is, its intersection with [Mat mx „(R)] fe y ) 
contains N { ^ n k . 

Moreover, the inclusion can be strict. The explanation is this: real matrix data 
can be R-transitive without being C-transitive because the directions that detect 
non-transitivity are non-real. A formal proof and examples are provided in [BG2 . 

Remark 2.6. The codimension of N^p k is computed in |BG2| : it is max(fc-m-n+2, 0). 

We also observe in [BG2] that , can fail to be real-algebraic. But we will not need 

those results in the present paper. 

2.2. Universal regularit y fo r constant inputs and richness. In this subsec- 
tion we prove Proposition ll .6t in fact we prove a more precise result, and also fix 
some notation. 

Recall that if A e GL(d, K) then the adjoint of A is the linear operator Ad^ on 
Ql(d, K) given by the formula Ad^(S) = ABA^ 1 . 

If A : IA — > GL(d, C) is a differentiable map then the normalized derivative of A 
at a point u is the linear map T U U — > gl(d, R) given by h >— » (DA(u) ■ h) o A _1 (u). 

Let 4>n(£,o, u) be the state £jv e KP d of the system (|I.4I) determined by the initial 
state £o and the input sequence u e U N . Let d2<pN(^o,u) be the derivative of the 
map 4>n(£,o, •) at u. 

Fix a constant input u = (u, . . . , u) e IA N : and local coordinates on U around u. 
Let Bj be the normalized partial derivatives of the map A at u with respect to the 
i th coordinate. Consider the data A = (A, B\, . . . , B m ), where A = A{u). Define 
the following subspace of gl(d, K): 

(2.1) Ajv(A)=K-Id+ span {Ad^(Bj)}, 

0sSti<W-1 

Proposition 2.7. For all £o e KP^" 1 and any Xq e ~K d \ {0} representing £o, 
rank<9 2 </>jv(£o,") = dim [ A w(A) • (A N x )] - 1. 

In particular (since A = A(u) is invertible), the input u is universally regul ar if 
and only if A^v(A) is a transitive space, which is the statement of Proposition 

Proof of Proposition \2A . Let £o = [xq] , where x Q e K d \ {0}. Let iPn{xq,u) be 
the final state of the non projectivized system fj 1 . 3[) determined by the initial state 



'This proof also appears in IBG2I 
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Xq and by the sequence of controls u e U N . Using local coordinates with u in the 
origin, we have the following first order approximation for u ^ 0: 

^ N (x ,u)^A N x + J] u tJ A N - t - 1 B 3 A t+1 x 

Id+ ^] ujv_i_ n jAd^(Bj) \x N , 
where x^r = iPn(xo, 0) = ^.^xo- Therefore the image of d2fpN(xo, u) is the following 



subspace of T A ■ ■ v: ' ' • 



V = span Ad^-Bj • xjv, 



The image of <5 2 <M£o,w) equals Dtt(x n )(V), where tt : K d \ {0} KP^ 1 is 
the canonical projection. Notice that Ker Dtt(x) = Kx for any x e K d \ {0}. It 
follows that 

rank 320v (^o j w) = dim [D-k(xn)(V)] 

= dim [Dtt(x n )(Kx n + V)] = dimpfLcjv + V] - 1 

Since Kccjy + V = Atv(A) • xjv, the proposition is proved. □ 

The discussion above motivates the introduction of a more general notation, 
which will be convenient later. Consider a linear operator H: E — > E, where E is 
a finite-dimensional vector space over the field K. Given a vector we E, the orbit 
of v under H is the set {H n v; n 5= 0}. Denote the space spanned by the orbit by 
sorb// v. We have 

sorb// v = {f(H) ■ v; f is a polynomial with coefficients in K} . 

It follows from the Cayley-Hamilton theorem that sorb// v is the space spanned by 
the first dimi? iterates of v. 

sorb// v = span{if"u; n = 0, . . . , dimE — 1}. 

Let us also denote 

sorbj/(ui, . . . , v n ) = sorb// V\ + ■ ■ ■ + sorb// v n . 

In this notation, the union A(A) := IJ^ Ajy(A) of the elements of the sequence 
(|2.ip is expressed as 

(2.2) A(A) = sorbAcU (Id, B m ), where A = (A, B x , . . . , B m ). 

We have Ajv(A) = A(A) for all N > d 2 , as stated in Remark [L~7l. 

2.3. The sets of poor data. For emphasis, we repeat the definition already gave 
at the introduction: The data A = (A,Bi,..., B m ) e GL(d, K) x [$l(d, K)] rn is rich 
if the space A(A) defined by f|2.2[) is transitive, and poor otherwise. The concept in 
fact depends on the field under consideration. The set of such poor data is denoted 



It follows immediately from Proposition \2.4 that V d is a closed and semialge- 



braic subset of GL(d, R) x [gl(d, R)] m and P^, is an algebraic subset of GL(d, C) 



[gl(d,C)] m . This proves part of Theorems Of and Q. 



Also, by Proposition 12.51 the real poor data are contained in the real part of the 
complex poor data, i.e., 

(2.3) n [GL(d, K) x [ Q l(d, K)] m ] <= . 
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Let us also note that the sets of poor data are saturated in the sense of the follow- 
ing definition: A set Z c [Matdxd(IK)] 1+m will be called saturated if (A, Bi, . . . , B m ) e 
Z implies that: 

• For all P e GL(d, K), the tuple (P^AP, P~ X B X P, P~ l B m P) belongs to 
Z. 

• For all Q = (<? y ) 6 GL(m,K), the tuple (A, B[, . . , , B' m ), where B[ = 
Y,j QijBj, belongs to Z. 

Remark 2.8. 1. A subset [Matdxd(]K)] 1+m is saturated if and only if it is invariant 
under a certain action of the group GL(d, K) x GL(m, K). 
2. The real part of a complex saturated set is saturated (in the real sense). 



2.4. The easy codimension inequality of Theorems ll.8l and I1.9L Here we 
will discuss the simplest examples of poor data. 

To begin, notice that if A e GL(d, C) is diagonalizable then so is Ad^- Indeed, 
assume without loss of generality that A = Diag(Ai, . . . , A^). Consider the basis 
{Eij; i,j e {1, . . . ,d}} of fll(d,C), where 

(2.4) Ei_j is the matrix whose only nonzero entry is a 1 in the (i,j) position. 
Then Ad A {E itj ) = A, A E,,. We summarize this fact as: 

/ 1_ AiA^ 1 • 

(2.5) Ad A = Diag A 2 A i 1 1 



So if / is a polynomial and B = ipij) then 

(2.6) the (i, j)-entry of the matrix (/(Ad^))(B) is /(AjAj 1 ^. 

The data A = (A, Bi, . . . , B m ) e GL(d, K) x g[(d, K) m is called conspicuously 
poor if there exists a change of bases P e GL(d, K) such that: 

• the matrix P~ X AP is diagonal; 

• the matrices P^B^P have a zero entry in a common off-diagonal position; 
more precisely, there are indices io, jo 6 {1, ■ • ■ , d} with irj ^ jo such that for 
each k £ {1, . .. ,m}, the (iq, jo) entry of the matrix P~ X B^P vanishes. 

(As in the definition of poorness, the concept depends on the field K.) 
Lemma 2.9. Conspicuously poor data are poor. 

Proof. Let A = (A, B\,..., B m ) be conspicuously poor. With a change of basis we 
can assume that A is diagonal. Let (ei, . . . , e^) be the canonical basis of K d . Let 
(i,j) be the entry position where all Bi's have a zero entry. By (|2.6[) . all matrices 
in the space A(A) given by (|2.2[) have a zero entry in the (io,jo) position. In 
particular, there is no L e A(A) such that L ■ ej = e !o , showing that this space is 
not transitive. □ 

The converse of this l emm a is certainly false. (Many exa mples appear in Appen- 
dix 0; see also Example I3.6I .) However, we will see in § 2J3 that the converse holds 
for generic A. 



W e w ill u se L emma I2.9I to prove the easy codimension inequalities for Theo- 
rems L8 and ll.9t first we need to recall the followineF^: 



1 ^Proposition [2, id follows from the implicit function theorem; for a proof using complex anal- 
ysis, see |Kal p. 67]. 
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Proposition 2.10. Suppose A e Mat^xdC^) is diagonalizable over K and with 
simple eigenvalues only. Then there is a neighborhood of A where the eigenvalues 
vary smoothly, and where the eigenvectors can be chosen to vary smoothly. 



Proposition 2.11 (Easy half of Theorems ll .81 and[L9|). For both K = K or C, we 
have codiniK Vm^ < m. 

Proof. Using Proposition |2. ldL we can exhibit smoothly embedded disks of codi- 
mension m inside GL(d, K) x gl(d, K) m formed by conspicuously poor data. □ 

2.5. Unconstrain ed matrices. The material from this s ubse ction is used in the 
proof of Theorem L2 , but not in the proof of Theorem ll.ll It is also used in 



Appendix 0- 

If p is an irreducible factor of the polynomial A^A^ — \j\k then the relation p = 
is called an elementary constraint in the variables Ai, . . . , A^. Every elementary 
constraint can be written, after a permutation of the indices 1, . . . , d, as one of the 
following: 

• a type 1 constraint: A1A3 = \\. 

• a type 2 constraint: A1A4 = A2A3. 

• a type 3 constraint: Ai = — A2. 

• a type 4 constraint: Ai = A2. 

We say that a matrix A e GL(d, R) is unconstrained if its eigenvalues, counted 
with multiplicity, do not satisfy any elementary constraint. 

Remark 2.12. A matrix A is unconstrained if and only if Ad^ has the maximal possible 
number of distinct eigenvalues, namely, d 2 — d+ 1. This is obvious from (|2.5[) if one restricts 
to diagonaliza ble m atrices A. The general case follows from the fact (which we will prove 
rigorously in § l4.3l ) that the multiplicities of the eigenvalues of AdA are those "predicted" 
by formula H2.5[) . 



Let us see that the converse of Lemma 1 2.91 holds for unconstrained A: 

Lemma 2.13. Suppose that the data A = (A, B\, ... , B m ) e GL(d, K) x gl(d, K) m 
is poor and that the matrix A is unconstrained. Then A is conspicuously poor. 

Proof. Suppose A is unconstrained. In particular, A has simple spectrum. With a 
change of basis we can assume that A is diagonal. 

Now suppose that A = (A, B±, ... , B m ) is not conspicuously poor. This means 
that for each off-diagonal position there is at least of of the matrices Bk that has 
a non-zero entry in that position. (Notice that this fact does not depend on the 
change of basis chosen before.) 

Since A is unconstrained, the values A^A^ 1 , where runs on the matrix 

positions outside the diagonal, are pairwise different, and all different from 1. Recall 
that one can always (using Lagrange formula) find a polynomial whose values at 
finitely many different points are prescribed. So It follows from (|2.6|) that the 
space A(A) contains all matrices (y^) such that yn = ■ ■ ■ = ydd, and in particular, 
all Toeplitz matrices. So A(A) is transitive, i.e., A is not poor. This proves the 
lemma. □ 



Let us establish another simple result, which is related to Theorem 1 1.2| . Denote 
by (ei, . . . , e<j) the canonical basis of C d . 

Lemma 2.14. Suppose that the data A = (A,B±, . . . ,B m ) e GL(d, C) x gl(d, C) m 

has the following properties: 

1. A is an unconstrained diagonal matrix; 

2. there are indices io, jo e {1, with io ¥= jo such that for each k e 
{!,... ,to}, the (iojj'o) entry of the matrix Bk vanishes; 
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3. the off-diagonal vanishing entry position (io,jo) above is unique. 
Then: 

1. There is a single direction [v] e CP d_1 such that A(A)-v ^ C d , namely [e JO ]. 

2. TTie space A(A) • e JO /ias codimension 1; in /act, i£ equals spanje^; i / i^}. 

Proof. Under the assumptions on A, the space A(A) contains 
{(Vij) e Ql(d, C); y 11 = • • • y loio = 0}. 
The conclusions follow easily. □ 

After the p relim inaries above, the optional Appendix [X| can be read (as we 
mentioned in § ll.6h . 



3. Rigidity 



The aim of this section is to state Theorem 13.71 and prove its first part. Along 
the way we will establish several lemmas which will be reused in the proof of the 
second part of the theorem in Section Q. 



3.1. Acyclicity. Consider a linear operator H : E —* E, where E is a finite- 
dimensional complex vector space. 

The operator H is called cyclic if it has a cyclic vector, that is, some v e E such 
that sorbff v is the whole space E. The following two lemmas are useful to find 
cyclic vectors, when they exist: 



Lemma 3.1. Suppose that E 



and that H is a Jordan block: 
( A 1 \ 



H 



\ 



1 

A/ 



Then a vector v = (x\, . . . , xi) is cyclic for H if and only if xg ^ 0. 
Proof. For any polynomial / we have (see [Gal page 100]): 



f(H) 



/(A) 



V 



/'(A) /"(A) \ 

1! 2! ' (i-iy. 



2! 

/'(A) 
1! 

/(A) J 



So the space spanned by the powers of H is the space of upper triangular Tocplitz 
matrices. The rest of the proof is an easy exercise. □ 

Lemma 3.2. Let E be a finite- dimensional complex vector space and let H : E — > E 
be a linear operator. Assume that E\, . . . , E% ez E are H -invariant subspaces and 
that the spectra of A\Ei (I i < k) are pairwise disjoint. If v% e E\, . . . , e 
then 

sorb ff (wi, . . .,v k ) = sorbff(ui H h v k ) . 
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Proof. The z> part is trivial; let us show the cz part. Take w e sorbjr (yi, . . . , Ufc), so 
w = 2 fi(H) ■ Vi, where each fi is a polynomial. Let pi be the minimal polynomial 
of H\Ei, and let qt = Yij^iPj- Since the spectra of A\Ei are pairwise disjoint, 
the polynomials pi are pairwise relatively prime, and so the polynomials qi are 
jointly relatively prime. Since polynomials form a principal ideal domain, there 
exist polynomials gi such that XI Sift = 1- Using that qi{H) • Vj = if i 7^ j, we 
have: 

i » \ 3 J 

= 2 fi{H) gi {H)<li{H) ■ Vi = [V fi(H) gi (H)n(H)) • £>. 

i V * } 3 

That is, to = f(H) ■ . u j f° r some polynomial /, as we wanted to show. □ 

We define the acyclicity of H as the least number n of vectors v±, . . . , v n e E 
such that sorb#(ui, . . . ,v n ) = E. We denote n = acyci/. So acycif = 1 means 
that if is a cyclic operator. 

Let us relate acyclicity with the Jordan normal form of H . The geometric mul- 
tiplicity of an eigenvalue A of if is the number of corresponding Jordan blocks or, 
equivalently, the dimension of the kernel of H — Aid. The following fact is probably 
well-known, but since we could not find a precise reference we provide a proofE] 

Proposition 3.3. The acyclicity of an operator equals the maximum of the geo- 
metric multiplicities of its eigenvalues. 

Proof. Let Ai, . . . , A& be the eigenvalues of H, counted without multiplicity, and 
E = E\ ® ■ ■ ■ ® Ek be the splitting into generalized eigenspaces. Let nt be the 
geometric multip licit y of Ai, and let n = maxn^. 



Using Lemma l3.ll . we find 1^1, . . . , v^ ni e Ei such that sorb# (vi ; i, . . . , Vi^ ni ) 



Ei. De fine Vjj = for m < j < n. Consider w = 2i=i v i,j> f° r 3 = -L • • • i n - By 



Lemma 13. 2 . sorb# uij = sorbp (vij, . . . , Vkj)- So 

sorbs (101, . . . ,w n ) = ^sorb^iq,.,, . . . ,v k ,j) = ^ j sorb H (v iA , . . . ,v i>n ) = E. 

3 i 

This shows that acyc H ^ n. 

To show the reverse inequality, assume that n = n\, for example. For each vector 
in E, write its coordinates with respect to the Jordan basis, and the consider only 
the coordinates corresponding to the rightmost columns of the Jordan blocks for 
Ai. This defines a linear map P: E — * C" such that PH = X±P. Now take any 
vectors U\, . . . , it n -i e E. Then the space S = sorbs (tii, . . . , u n -i) is sent by P to 
the vector space span{Pui, . . . , Pu n -i}, which has dimension ^ n. Since P is onto 
C", the space S cannot be the whole E. This shows that acyc if ^ n, completing 
the proof. □ 

Remark 3.4. The operators which interest us most are H = AdA, where A e GL(d, C). 
It is useful to observe that the geometric multiplicity of 1 as an eigenvalue of AdA equals 
the the codimension of the conjugacy class of A inside GL(d, C). To prove this, consider 
the map *a : GL(d, C) -» GL(d, C) given by ^a(X) = Adx(^4). The derivative at X = Id 
is H 1 * HA- AH; so Ker D\Ju(Id) = Ker(AdA - id). Therefore when X = Id, the rank of 
D^a(X) equals the geometric multiplicity of 1 as an eigenvalue of AdA- To see that this is 



The usual textbook approach is the other way around: one uses results about cyclic operators 
to obtain the Jordan normal form; see e.g. IGal . 
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true for any X, notice that = ^Ad x CA) oRx- 1 (where R denotes a right-multiplication 

diffeomorphism of GL(d, C)). 

We will see later (Lemma l4.15r) th at 1 is the eigenvalue of AdA with the biggest geomet- 
ric multiplicity. By Proposition 13.31 . we conclude that acyc AdA equals the codimension 
of the conjugacy class of A. 

3.2. Definition of rigidity, and the main rigidity estimate. Let E and F be 

finite-dimensional complex vector spaces. Let H be a linear operator action on the 
space £(E, F). We define the rigidity of H, denoted rig if, as the least n such that 
there exist L\, . . . , L n e C(E, F) so that sorbjj(£i, ■ ■ • , L n ) is transitive. Therefore 

1 < rig H acyc H . 

For technical reasons, we also define a modified rigidity of H , denoted rig + H . 
The definition is the same, with the difference that if E = F then L\ is required to 
be the identity map in C(E, E). Of course, 

rig H rig + H < rig H + 1. 



We want to give a reasonably good estimate of the mod ified rigidity of Ad,4 for 



any fixed A e GL(d, C). (This will be achieved in Lemma |4. 18l ) We assume that 
d 5= 2; so rig + Ad,4 > 2. The next example shows that "most" matrices A have the 
lowest possible rig + Ad^. 

Example 3.5. If A e GL(d, C) is unconstrained (see § |2.5| ) then rig + AdA = 2. Indeed 
if we take a matrix B e C) whose expr ession in the base that diagonalizes A has no 
zeros off the diagonal then, by Lemma 1 2 . 1 3l . A(A, B) = sort>Ad A (Id, B) is rich. 

More generall y, if A e GL(d, C) is little constrained (see Appendix 0) then it follows 
from Proposition I A. 21 that rig + AdA = 2. 

Example 3.6. Consider A = Diag(l, a, a 2 ) where a = e 2 '"'' 3 . fin the terminology of § l2~5l . 
A has constraints of type 1.) Since Ad A is the identity, we have dim sorb Ad a (Id . B) ^ 4 
for any B e £)1(3, C). By the result of Azoff Az] already mentioned at Example |2.ll . the 
minimum dimension of a transitive subspace of g[(3, C) is 5 . This shows that rig + AdA ^ 3. 
(Actually, equality holds, as we will see in Example 13.101 below.) 



Let T be the set of roots of unity. Define an equivalence relation — on the set 
C* of nonzero complex numbers by: 

(3.1) \~\'^ A/A' e T. 

We also say that A, A' are equivalent mod T. 
For A e GL(d, C), we denote 

(3.2) c(A) := number of different classes mod T of the eigenvalues of A. 

We now stat e a t echnical result which has a central role in our proofs, as explained 
informally in 

Theorem 3.7. Letd^2 and A e GL(d,C). Then: 

1. If c{A) = d then rig + Ad^ = 2. 

2. If c(A) < d then rig + Ad A acyc Ad^ - c(A) + 1. 

Remark 3.8. When c(A) = d, we have acyc AdA = d (this will follow from Lemma l-Lis! ): 
so the conclusion of part [5] does not hold in this case. 

Remark 3.9. The conditions of A being unconstrained and A having c(A) = d both m ean 
that A in "non-degener ate" . Both of them imply small rigidity, according to Example 13.51 
and part [T] of Theorem |3.7| . It is important, however, not to confuse the two properties; 
in fact, none implies the other. 
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Example 3.10. Consider again A as in Example |3.q. The eigenvalues of AdA are 1 , a, 
and a 2 , each with multiplicity 3; so Proposition 13.31 gives acyc AdA = 3. So Theorem 13.71 
tell us that rig + AdA < 3, which is actually sharp. 



The pro of of part[T]of Theorem [3/7J will be given in S l3.5l after a few preliminaries 
f^ l3.3l and l3.4 ). These preliminaries are also used in the proof of the harder part[^l 
which will be given in Section 

3.3. A criterion for transitivity. We will show the transitivity of certain spaces 
of matrices that remotely resemble Toeplitz matrices. 

Let t, s be positive integers. Let TZi be a partition of the interval [l,i] = 
{l,...,t} into intervals, and let H2 be a partition of [l,s] into intervals. Let TZ 
be the product partition. We will be interested in matrices of the following special 
form: 

/ 



(3.3) 



M 



(rrii j) is;. «t 



V 









\ 


* 













Mr 











* 










/ 



where R is an element of the product partition TZ, and Mr is the submatrix (m, ,j)uj)eR- 

Let A be a vector space of t x s matrices. For each R e TZ, say of size k x I, we 
define the following space of matrices: 
A [K] = ( N g Mat 



(3.4) 



; 3 M e A of the form {33} with Mr = N}. 



We regard A as a subspace of £(C* , C s ). If the rectangle R is [p,p + k] x [q, q + £], 
we regard the space AM as a subspace of 

^({O}^ 1 x C fe x {0}*-^, {0}"- 1 xC'x {0} s - q - e ). 
Lemma 3.11. Assume that AW is transitive for each Re TZ. Then A is transitive. 

An interesting feature of the lemma which will be useful later is that it can be 
applied recursively. Before giving the proof of the lemma, we illustrate its usefulness 
by showing the transitivity of generalized Toeplitz spaces: 



Proof of Examvle \2.a . Consider the partition of [l,d] 2 into lxl "rectangles". If 
A is a generalized Toeplitz space then A_W = Matixi(C) = C for each rectangle R. 
These are transitive spaces, so Lemma 13. Ill implies that A is transitive. □ 



Before proving Lemma I3.11L notice the following dual characterization of tran- 
sitivity, whose proof is immediate: 

Lemma 3.12. A subspace A ez £(C',C S ) is transitive iff for any non-zero vector 
u e C and any non-zero linear functional <f> e (C s )* there exists Me A such that 
<j>(M ■ u) * 0. 



Proof of Lemma lS.li . Take any non-zero vector u = (ux, . . . , u t ) in C and a non- 
zero functional <f>{v\, . . . ,v s ) = 2j=i ( t ) 3 v j m (C s )*. By Lemma 13.12 , we need to 
show that there exists M = (i% ) e A such that 

t s 

(3.5) </>(M • u) 



\ 2 (f>jXijUi 



is non-zero. 
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Let io be the least index such that Ui ¥= 0, and let jo be the greatest index such 
that <f>j 0. Let R be the element of 72. that contains (io, jo)- Notice that if M 
is of the form (J3T3J) then the (i, j)-entries of M that are above left (resp. below 
right) of R do not contribute to the sum (|3.5p . because 4>i (resp. Uj) vanishes. That 
is, cf>{M ■ u) depends only on Mr and is given by j)<=n 4>j x ij u i\ Since AW is 



transitive, by Lemma 13.121 there is a choice of a matrix MeAof the form (|3.3[) so 



that tf)(M ■ u) ¥= 0. So we are done. □ 

3.4. Preorder in the complex plane. We consider the set C*/T of equivalence 
classes of the relation (|3.ip . Since T is the torsion subgroup of C*, the quotient 
C#/T is an abelian torsion-free group. Therefore it admits a multiplication-invariant 
total order <, by a result of Levi [LejF 2 ] 

Let [z] e C*/T denote the equivalence class of z e C#. Let us extend the 
notation, writing z < z 1 if [z] < [z']. Then < becomes a multiplication-invariant 
total preorder on C# that induces the equivalence relation ~. In other words, for 
all z, z', z" e C* we have: 

• z < z' or z' < z; 

• z ^ z' and z' < z <^^> z ~ z'; 

• z =^ z' and z' < z" ==> z < z"; 

• z < z' => zz" < z'z". 

It follows that: 

• z<z' => (z') _1 < 

We write z < z' when z < z' and z z' . 



3.5. Proof of the easy part of Theorem 13.7 



Proof of part [I] o/ Theorem \3.'A . If c(^4) = d then in particular all eigenvalues are 
different and so the matrix A is diagonalizable. So with a change of basis we can 
assume that A = Diag(Ai, . . . , A^). We can also assu me t hat the eigenvalues are 



increasing with respect to the preorder introduced in § |3.4 : 

Ai < A 2 < • • • < X d ■ 

Fix any matrix B with only nonzero entries, an d con sider the space A = sorbAd^ X 



which is described by (|2.6I) . We will use Lemma 13.111 to show that A is transitive. 
Let 72. be the partition of [1, d] 2 into lxl rectangles. Given a cell R = {(io, jo)} e 72 
and a coefficient t e C, there exists a polynomial / such that f(X i \J 1 ) equals t if 
XiXj 1 = A^A^ 1 and equals otherwise. Because the eigenvalues are ordered, 
M = f(Ad A ) • 73 is a matrix in A of the form Also, M R = (t). So A™ = C, 

which is transitive. This shows that rigAd^ = 1, and rig + Ad^ ^ 2. Thus, as 
d 5= 2, we have rig + Ad^ = 2. □ 

4. Proof of the hard part of the rigidity estimate 



This section is wholly devoted to prove part[5]of Theorem l3.71 . In the course of the 
proof we need to introduce some terminology and to establish several intermediate 
results. None of these are u sed i n the rest of the paper, apart form a simple 
consequence, which is Remark 14.161 . 



12 Lct us give a direct proof of the existence of an invariant order on <C#/T. There is an 
isomorphism between R © (R/Q) and <C#/T, namely (x, y) i— » exp(x + 2iriy). So it suffices to find 
an invariant order in R/Q (and then take the lexicographic order). Take a Hamel basis B of the 
Q-vector space R so that 1 e B. Then R/Q is a direct sum of abelian groups Q) xeB xQ. Order 
each xQ in the usual way, take any total order on B, and consider the induced lexicographic order 
on '. 
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4.1. The normal form. Let A e GL(rf, C). In order to describe the estimate on 
rig, Adyt, we need to put A in a certain normal form, which we now explain. 



We fix a preorder < on as in § \3A 



Let Ai, A r be the eigenvalues of A, listed without repetitions, and with 
respective multiplicities si, . . . , s r . Assume they are ordered: 

(4.1) A x <---<A r . 

Reindex the sequence of eigenvalues Ai, . . . , A r as 

Al,l — Al,2 — • • • — Al jri < A24 — X2.2 — ' ' ' — ^2,r 2 < ■ ■ ■ 

Write each eigenvalue in polar coordinates: 

Aj.j = Ti exp(6i.j\/—l), where T{ > and ^ 8ij < 2ir. 
Reorder the eigenvalues so that, for each i, 

8i,l < 9i,2 < ■ ■ ■ < 0i,n ■ 

With a change of basis, we can assume that A has modified Jordan form: 



(4.2) 



.4 



A, 



\ 



where tk,i + • • • + tk,r k = Sk and D t is the following t x t Jordan block: 

1/ 

The matrix A will be fixed from now on. 



(4.3) 



D, 



V 



4.2. Geography. This subsection contains several definitions which will be fun- 
damental in all arguments until the end of the section. We will define certain 
subregions of the set {1, . . . , d} 2 of matrix entry positions, which depend on the 
normal form of the matrix A. Later we will see they are related to Ad^-invariant 
subspaces. We will use "geographical" terms for those regions: islands, cities, and 
districts. The regions will have some numerical attributes (banner, area, popula- 
tion); these attributes may seem mysterious initially, but later we will relate them 
with numerical invariants of Ad,4 (eigenvalues, multiplicities, geometric multiplic- 
ities). We also introduce other attributes of the regions (northern and southern 
cities, latitude of a district) which will be useful later in the proofs of our rigidity 
estimates. 



Recall A is a matrix in normal form as explained in § [4JJ . Define three partitions 
'Pi, "Pc, Pd of the set [l,d] = {1, . . . , d} into intervals: 

• The partition V\ corresponds to equivalence classes of eigenvalues under the 
relation ~: the right endpoints of its atoms are the numbers s\ + • • ■ + Sk 
where k = r or k is such that Afe < Xk+i- 

• The partition V c corresponds to eigenvalues: the right endpoints of its atoms 
are the numbers s\ + • • ■ + s^, where 1 ^ k ^ r. So V c refines Pi. 

• The partition V& corresponds to Jordan blocks: the right endpoints of its 
atoms are the numbers s\ + ■ ■ ■ + Sfc_i + tk.i + ■ ■ • + tk/, where 1 ^ fc r 
and 1 ^ £ ^ Tk- So Vd refines V c . 
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For * = i, c, d, let V% be the partition of the square [1, d] 2 into rectangles that are 
products of atoms of V* . The elements of V 2 are called islands, the elements of V 2 
are called cities, and elements of V 2 are called districts. Thus the world W = [1, d] 2 
is a disjoint union of islands, each of them is a disjoint union of cities, each of them 
is a disjoint union of districts. 

Example 4.1. Suppose d = 17, A has r = 5 eigenvalues 

Ai = exp \ni, A2 = exp \^i, A3 = exp ^-Tri, A4 = 2exp A5 = 2exp |-7ri 

with respective Jordan blocks of sizes 4, 2, 1; 3, 2; 2; 2, 1. Then there are 4 islands, 25 
cities, and 64 districts. See Fig. [l|. 
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FIGURE 1. The geography corresponding to Example l4.ll . Thick (resp., thin, 
dashed) lines represent island (resp., city, district) borders. Population and 
latitude of each district inside a selected city are indicated. The population 
of each city is recorded in its upper left corner, along with a symbolic repre- 
sentation of its banner. There are three banner classes (• = [1], JJ- = [2] and 
ff = [1/2]), each of them with 3 different banners. Southern cities are marked 
with S. 

For each city (or district) we define its row eigenvalue and its column eigenvalue 
in the obvious way: If a city C equals Ik x h where Ik and It are intervals with 
right endpoints s\ + • ■ • + and si + ■ ■ ■ + S£, respectively, then the row eigenvalue 
of C is Afc and the column eigenvalue of C is A^. The row and column eigenvalues of 
a district D are defined respectively as the row and column eigenvalues of the city 
that contains D. 

Let C be a city with row eigenvalue A^j and column eigenvalue Xk,£- The banner 
of C is defined by The argument of the city is the quantity Qk,i — di.j e 

(— 2tt, 2tt). (It coincides, modulo 2ir, with the argument of the banner.) The city is 
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called southern within its island if it has strictly negative argument, and northern 
within its island otherwise. 

Each district D has an address of the type "« th row, j th column, city C" ; then 
the latitude of the district D within the city C is defined as j — i. See an example in 
Fig. i 

If two cities lie in the same island then their banners are equivalent mod T . Thus 
every island has a well-defined banner class in C*/T. 

If a district, city, or island intersects the diagonal {(1, 1), . . . , (d, d)} then we call 
it equatorial. Equatorial regions are always square. Thus every equatorial city has 
banner 1 and every city with banner 1 lies on a equatorial island. 

The area of a district, city, island or world is defined as the product of its sides. 
The population of a district is defined as the minimum of its sides. Populations of 
cities, islands and world are defined as the sum of the areas and populations of the 
corresponding districts. 

Let us noti ce so me facts on the location of the banners (which will be useful to 
apply Lemma 13. Ill ): 



Lemma 4.2. Let C be a cit 

and the island I as in Fig 



in an island I . Consider the divisions of the world W 
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FIGURE 2. The divisions of W and I in Lemma w7: 



Let (5 be the banner of the city C, and let 
Then: 



be the banner class of the island I . 



1. All the islands with banner class [/?] are inside the regions marked with x . 

2. If the city C is northern (resp. southern) within I then the all the northern 
(resp. southern) cities with same banner (3 are inside the regions marked 
with +. 

Proof. In view of the ordering of the eigenvalues (|4.ip . the banner class increases 
strictly (with respect to the order <, of course) when we move rightwards or upwards 
to another island. So Claim (TT]) follows. 

The argument of a city takes values in the interval (— 27r, 2n). It increases strictly 
by moving rightwards or upwards inside I. If two cities in the same island are both 
northern or both southern then they have the same banner if and only if they have 
the same argument. So Claim ^ follows. □ 

4.3. The adjoint in geographical terms. Given any d x d matrix X = (xij) 
and a district, city or island R = [p,p + t] x [q, q + s] we define the submatrix 
of X corresponding to R as (xi,j)u,j)<=R' We regard the space of R-submatrices as 
^({O}^ 1 x C* x {0} d - p -*,{0} 9 - 1 x C s x {0} d - q - s ), or as the set of d x d matrices 
whose entries outside R are all zero. Such spaces are denoted by R°, and are invariant 
under Ad^. Indeed, if R = D is a district then identifying D° with Matt xs (C), the 
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action of Ad^lD" is given by 



X 



XkXj'DtXD: 



where \kX e is the banner of D, and D t , D s are Jordan blocks defined by (|4.3[) . 

If R is an equatorial district, city, or island we will refer to the d x d-matrix in R" 
whose R-submatrix is the identity as the identity on R". The following observation 
will be useful: 

Lemma 4.3. IfD is an equatorial district then the identity on D" is a eigenvalue 
of the operator Ad^lD" corresponding to a Jordan block of size lxl. 

Proof. Suppose D has size t x t. Assume that the claim is false. This means that 
there exists a matrix X e Matt x t(C) such that D t XD^ 1 = X + Id, which is 
impossible because X and X + Id have different spectra. □ 

We are going to prove the following: 

Lemma 4.4. For each district D, the only eigenvalue of Ad^|D D is the banner of 
the city that contains D. Moreover, the geometric multiplicity of the eigenvalue is 
the population of the district. 

The following facts are immediate consequences: 

• The eigenvalues of Ad.A are the banners of cities. 

• The multiplicity of the eigenvalue (3 for the operator Ad,4 is the total area 
of cities of banner j3. 

• The geometric multiplicity of the eigenvalue (3 for Ad^ is the total population 
of cities of banner j3. 



Lemma 14.41 is equivalent to the following: 
Lemma 4.5. Let Ut lS be the linear operator on Matt x «(C) given by 

U t ,,(X) = D t XD-\ 

where D t , D s are Jordan blocks defined by (|4.3p . Then the only eigenvalue of Ut, s 
is 1, and its geometric multiplicity is min(t, s). 



The rest of this subsection is devoted to prove Lemma l4.5l . To begin, notice that: 

( 1 -1 (-l) 8 -^ 
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To describe Ut. s , it suffices to describe its action on the matrices E^j whose 
unique nonzero entry is a 1 in the position. Using (|4.4[) . we obtain 
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or, visually: 
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The picture above suggests a way of "embedding" all the maps Ut s into a single 
infinite-dimensional model. More precisely, consider the space M. of infinite ma- 
trices of the form X = {xk,t)k,i^o, where k, £ are non-positive integers, that have 
only finitely many non-zero entries. For each pair of positive integers t, s, define a 
monomorphism i t s : Mat txs (C) — > M. by 



if k > — t and I > —s, 
otherwise 



(h,j)i,j ' * {xk,i)k,e where x k ,i = 
Define a linear operator U : M. —* M. by 

i 

(x k ,i)k,£ ^ (yk,e)k,i where y k ,t = £j C -1 )' - '^^ + Zfe-1,9) 
Then the following diagram commutes: 

Mat txs (C) ^-+M 



u 



Mat txs (C) '^M 

Let us prove a few facts about the operator U. It is convenient to consider also 
N = U — id. 

If X = (xk,e) e M. and n = 1,2,..., then we define the n th diagonal of X as the 
n-tuple (x Q ._( n _i), x_i_i n -2)i ■ ■ ■ ,i-( n -i).o)' Define the height h(X) of X as if 
X = 0, otherwise is the maximal n such that X has a nonzero n th diagonal. 

It is clear that 

(4.5) h(N(X)) < h(X) if X ± 0. 

It follows that the operator ./V is nilpotent, in the sense that every orbit eventually 
hits zero. 

Lemma 4.6. Let X = (xk.e) e and Zei Z = (z k j) = N(X). If h(X) < n i/ien 
i/ie n" 1 diagonal of X can be determined from its first element and the [n — l) th 
diagonal of Z by the formula 

p-i 

CC_ Pi _(„_i_ p ) = I ,-(n-l) + XI Z -q~{n-2-q) , (p = 0, 1, . . . , 71 - 1). 

9=0 



Proof. It suffices to see that, for each g = 0,l,...,n — 2, 



Z -<?,-("-2-<j) — a; -((3+l),-(n-2-g) ^C— q,— (n— 1— g) 



□ 



For each t = 1, 2, . . . , let It e denote the image under t^t of the t x t identity 
matrix. A linear combination of matrices of this type is a matrix with constant 
diagonals and so will be called a Toeplitz matrix. 

Lemma 4.7. U(X) = X if and only if X is a Toeplitz matrix. 

Proof. Since the t x t identity is fixed by the Utj, we conclude that It is fixed by U, 
proving the "if" part. 

To see the conv erse, take X = (xk,e) in the kernel of N. Let n = h(X) be height 
of X. By Lemma |4.6| . the n th diagonal of X is constant, say (c, c, . . . , c). Thus 
X — cl n has height at most n — 1, and belongs to the kernel of N. It follows by 
induction in n that X is a Toeplitz matrix. □ 
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Proof of Lemma Since U — id is nilpotent, so is Ut, s ~ id, which means that 
the only eigenvalue of Ut, s is 1. 

The matrices I\, I2, • I m in(t,s) belong to the image of it >s ; therefore their 
inverse images are eigenvectors of Ut >s - The space V spanned by these eigenvectors 
is exactly 

{M e Mat txs (C); tt )S (M) is a Toeplitz matrix}. 

By Lemma l4?H , V is also the space of the eigenvectors of U± a . This proves that the 
geometric multiplicity of Ut, s is min(t, s). □ 

Remark 4.8. It is natural to ask what are the siz es o f the Jordan blocks corresponding to 
the eigenvectors exhibited in the proof of Lemma U .41 . We don't know the answer, except 
for the last eigenvector b7l(Im in(t s)), which corresponds to a 1 x 1 Jord an block. This 
fact, which generalizes Lemma k.3l . can be easily shown using Lemma k.d . 

4.4. Rigidity estimates for districts and cities. 

Lemma 4.9. For any district D, we have rig + (Ad J 4|D°) < popD. 



Proof. By Lemma l4.4l (and Proposition l3.3i ). Ad/i|D n has acyclicity n = popD, that 
is, there are matrices Xi, . . . , X n e D" such that sorbAd A (^i; ■ • • ,X n ) is the whole 
D° (and, in particular, is transitive in D°). So rig(Ad J 4|D°) < n, which proves the 
lemma for non-equatorial districts. 

If D is an equatorial district then, by Lemma |4.3| . D" splits invariantly into two 
subspaces, one of them spanned by the the identity matrix on D°. So we can choose 
the matrices Xi above so that X\ is the identity. This shows that rig + (Ad J 4|D°) < 
n. □ 

In all that follows, we adopt the convention max0 = 0. 

Lemma 4.10. For any city C, 

rig + (Ad j4 |C D ) V max rig + (Ad A |D D ) . 

1 D is a district of C 1 

I latitude with latitude I 

Proof. For each district D in C, let r(D) = rig + (Ady!i|D n ). Take matrices Xd,i, . . . , 
X Dj .( D ) such that A D := sorbAd A (^d,i, ■ • • , -Xi>,r(D)) is a transitive subspace of D°, 
and A D ,i is the identity matrix in D D if D is an equatorial district. Define Xj> j = 
for j > r(D). For each latitude £, let ne be the maximum of r(D) over the districts 
D of C with latitude £, and let 

D is a district of C 
with latitude £ 

Notice that if C is an equatorial city then Y"o,i is the identity matrix in C°. Consider 
the space 

A = sorbAd A \Xi,j'i is a latitude, 1 s$ j ^ nA. 
We claim that for every district D in C and for every M e A D , we can find some 
N e A with the following properties: 

• the submatrix iV D equals M; 

• for every district D' in D that has a different latitude than D, the submatrix 
Njy vanishes. 

Indeed, if M — Xlj!=i /j(Ad J 4)A Dj - for certain polynomials fj, we simply take N = 
2^=i fj(^A)Ye,j, where £ is the latitude of D. 

In notation (13.41) . the claim we have just proved means that AM dA d . So we can 
apply Lemma 1 3 . 1 ll and conclude that A is a transitive subspace of C°. Therefore 
rig + (Ad J 4|C°) < Yj n ii as we wanted to show. □ 
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Example 4.11. Using Lemmas 14.91 and l4.10l . we see that the city C whose district popu- 
lations are indicated in Fig.Q has rig + (AdA|C D ) ^ 5. 

In fact, we will not use Lemmas k.9l and 4.1dl directly, but only the following 
immediate consequence: 

Lemma 4.12. For every city C we have rig + (Ad^i|C D ) ^ popC. The inequality is 
strict if has more than one row of districts and more that one column of districts. 

4.5. Comparative demographics. If R is a district, city or island, we define its 
row projection tt t (R) as the unique equatorial district, city or island (respectively) 
that is in the same row as R. Analogously, we define the column projection 7r c (R). 

Lemma 4.13. For any city C, we have 

< pOp7T r (C) + pO P 7r c (C) 

Moreover, equality implies that the number of rows of districts for C equals the 
number of columns of districts. 

This is a clear consequence of the abstract lemma below, taking x a , a e Fa 
(resp. a e Fi) as the sequence of heights (resp. widths) of districts in C, counting 
repetitions. 

Lemma 4.14. Let F be a nonempty finite set, and let x a be positive numbers 
indexed by a e F. Take any partition F = Fq u F\. For e, S e {0, 1}, let 

T, eS = ^ vaia(x a ,xp). 

{a,j3)sF e xF 6 

Then 

v ^ - Eoo + En 
^01 — ^10 ^ ^ • 

Moreover, equality implies that Fq and F\ have the same cardinality. 

Proof. We will in fact prove the stronger fact: 

(4.6) £ 00 - 2£ 01 + £ n ^ (|F | - |Fx|) 2 min^ , 

where |-| denotes set cardinality. The proof is by induction on \F\. It clearly holds 
for \F\ = 1. Fix some n and assume that (|4. 6[) always holds when \F\ = n. Take a 
set F with \F\ = n + 1, and take positive numbers x a , a e F. We can assume that 
F = {1, . . . , n + 1} and that x± > • • • > x n+ \. Take any partition F = Fq u F±. 
Without loss of generality, assume that n+1 e F . Apply the induction hypothesis 
to F' = {1, . . . , n}, obtaining 

S' 00 -24 1 + E' u > (lFol-1-lFiDV- 

We have 

S o = E' 00 + (2|F | - l)x n+1 , E i = S' 0l + |_Fi|x n+ i , and En = E' u , 
so (JUnj follows. □ 

If R is an island or the world, let pop 1 R denote the banner 1 population on R, 
that is, the sum of the populations of the cities in R with bann er 1. 
Let us give the following useful consequence of Lemma 14.13 : 

Lemma 4.15. acycAd^ = popj W. 



24 



BOCHI AND GOURMELON 



Proof. By Proposition l3.3l acyc Ad a is the maximum of the geometric multiplicities 
of the eigenvalues of AcU. Those eigenvalues are the banners /?, and the geometric 
multiplicity of each /3 is the worldwide total population with banner j3. Thus, to 
prove the lemma we have to show that ban ner 1 has biggest worldwide population. 
Let ft be a banner. Then, using Lemma 4.131 



Yi P°P Cs S^ J] P°P 7r r(C)+i J] P0P7T C (C). 



2 M. w 2 

L is a city 
with banner /3 



Since no two cities in the same row (resp. column) can have the same banner, the 
restriction of 7r r (resp. 7r c ) to the set of cities with banner /? is a one-to-one map. 
This allows us to conclude. □ 

Remark 4.16. The Jordan type of a matrix A e Matdxd(C) consists on the following 
data: 

1. The number of different eigenvalues. 

2. For each eigenvalue, the number of Jordan blocks and their sizes. 

It follows from Lemma [4.15l that these data is sufficient to determine acyc Ad^ . (Of course, 
one can easily write down a formula; see e.g. [Gal p. 222] or |Arl p. 241].) 

4.6. Rigidity estimate for islands. 

Lemma 4.17. For any island I, 

n g+ (Ad A \r) < P°Pi^(l) + P0P l7 re(l) 

In order to prove this lemma, it is convenient to consider separately the cases of 
non-equatorial and equatorial islands. 

Proof of Lemma li.l'j when I is non- equatorial. For each banner j3 in I, let np (resp. 
sp) be the maximum of rig + (Ad J 4|C°) over the northern (resp. southern) cities C 
in I with banner /3. For each city C with banner /3, choose matrices A c .i, 
Xcnp+sf, e C° such that: 

• A c := sorbAd A (^c,i, ■ • ■ , A c , m ) is a transitive subspace of C°; 

• if C is southern then Xi = X2 = • • • = X n „ = 0; 

• if C is northern then X n/j+ i = • • • = X nfs+Sfj = 0. 

Also, let A c j- = for j > np + sp. 
Next, define 



(4.7) y w = 2 X 



C.J 



C is a city 
of I with banner /3 



and 

(4-8) Z S = J] F « 

p banner on I 

Consider the space 

A = sorDAcu(-^i; ■•• 7 Z m ), where m = max {np + sp) 

P banner on I 



ft follows from Lemma 13.21 that 

A = sorbAcU {Ypj; /? is a banner, 1 j < np + spj. 
Recall notation (|3.4j) . We claim that 
(4.9) AccAM. 

Indeed, given M e A c , write M = /j(AdA)A Cj -, where the fj's are polynomi- 
als and fj = whenever X c .j = 0. Consider N = ^ fj{AAA)Yp t j, where /3 is 
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the banner of C. Then it fo llows from Lemma (part [2]) that N e A^. This 
shows (|4.9[) . So, by Lemma 13 . 1 lL A is a transitive subspace of 1°, showing that 
rig + (AcU|l D ) < m. 

To complete the proof of the lemma in the non-equatorial case, we will show 
that 

(4.10) m< po Pl 7r r (l)+pop 1 7r c (l)_ 

Let (3 be the banner for which np + sp attains the maximum m. If np > 0, let 
Cjv be a northern city within I with banner j3 and rig + (Ad J 4|C^) = np. If sp > 0, 
let Cs be a southern city within I with banner (3 and rig + (Ad J 4|C^) = sp. Assume 
for the moment that both cities exist. Let Ci, C2, C3, C4 be projected equatorial 
cities as in Fig. 0. 




Figure 3. Cj = ^(c^), c 2 = 7r r (c s ), c 3 = vr c (c s ), c 4 = 7r c (Cjv). 



Then 



(i) 



m = rig + (AdA|CAr) + rig^Ad^lCs) ==; popCw + popCs 

(ii) 

< i(popCi H h popC 4 ) |(pop 1 Ii + pop x I 2 ), 

where (i) and (ii) follow respectively from Lemmas |4 . 1 2| and |4 . 1 3l . This proves (|4.10j) 
in this case. If there is no southern city or no northern city within I with banner 1 
then the proof of (|4.10[) is easier. 

So the lemma is proved for non-equatorial I. □ 

We now consider equatorial islands. There is an exceptional kind of island for 
which the proof of the rigidity estimate has to follow a different strategy. An island 
is called exotic if it has only the banners 1 and —1 (so it is equatorial and has 4 
cities), each city has a single district, and all districts have the same population. 

Proof of Lemma \i.l'i when I is equatorial non-exotic. As in the previous case, let 
np (resp. sp) be the maximum of rig + (Ad y i|C n ) over the northern (resp. southern) 
cities C in I with banner (3. 
We claim that 

(4.11) 71,3 + sp < pop x I for all banners /3 ^ 1 in I. 

Let us postpone the proof of this inequ ality and see how to conclude. 

Let M = pop x I. In view of Lemma |4.12| and relation (|4.11[) , for each island C 
we can take matrices A Cj i, . . . , A Cj m e C° such that: 
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• A c := sorbAd A (^c,i, • • • , Xc,m) is a transitive subspace of C D ; 

• X c ,m = if C is non-equatorial; 

• Xc,m is the identity in C" if C is equatorial. 

Then define matrices Zj as before: by (|4.7I) and (|4.8I) . Here we have that Zm is 
the identity matrix in 1°. As before, sorbAd A (Zi, • • • , Zm) is a transitive subspace 
of 1°. Hence rig + (Ad J 4|I°) ^ M = popj I, as desired. 

Now let us prove (|4.1ip . Consider a banner j3 ^ 1 in I. Let Cat (resp. Cg) be a 
northern (resp. southern) city within I with banner j3 and of maximal population; 
assume for the moment that both cities exist. Let Ci, C 2 , C3, C 4 be projected 
equatorial cities as in Fig. U 



Ci 






c 2 








c 3 




Cs 


C 4 



I 



Figure 4. Ci = 7r c (c s ), c 2 = ^(Cjv), c 3 = n (c N ), c 4 = 7r r (c s ). it is 

possible that Ci = C2 or C3 = C4. 



Then 

np + sp = rig + (Ad j4 |CAr) + rig + (AdA|C s ) 

(4.12) ^ pop Cat + popCs 

(4.13) <|(popCiH hpopC 4 ) 

(4.14) ^ pop 4 1. 

Inequality (I4.12[) follows from Lemma 4.12 , inequality (I4.13j) follows from Lemma 4.13L 



and inequality (|4.14l) holds because the cities Ci, . . . , C 4 are equatorial, and any 
city can appear at most twice in this list. So 

(4.15) np + sp < pop : I. 

In the case that there is no northern city or no southern city with banner j3 (i.e., 
np or sp vanishes), a simpler argument shows that strict inequality holds in (|4.15[) . 

Now assume by contradiction that (|4.1ip does not hold. Then we must have 
equality in (|4.15l) . By what we just saw, both cities Cat and Cs above exist. Then 
the inequalities in (|4.12p ~ (|4.14l) become equalities. Since (I4.14[) is an equality, there 
must be exactly two equatorial cities in I . So the non-equatorial banner /3 sat isfies 



P 1 = /3, that is, P = — 1. Since (|4.13[) is an equality, it follows from Lemma 14.13 



that both non-equatorial cities are district-square. So there is some £ such that all 
four cities in I h ave I rows of districts and I columns of districts. Since (|4.12[) is an 



equality, Lemma |4.12| implies that 1=1. That is, I is a exotic island, a si tuati on 



which we excluded a priori. This contradiction proves (j4.11[) and Lemma 14.171 in 



the present case. □ 

We now come to exotic islands. In all the previous cases, the transitive subspace 
we found had some vaguely Toeplitz form. For exotic islands, however, this strategy 
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is not efficiently What we are going to do is to find a transitive space of vaguely 
Hankel form, namely the following: 

(4.16) A fc = | ; M, N, P are k x k matrices 

Notice that A& = Sk ■ Tk, where 

Sk = ( and Tfe = | f ^ ; M, N, P are fc x k matricesj . 

Since Tk is a generalized Toeplitz space, it follows from Remark |2.3| that A& is 
transitive. 

Proof of Lemma \i.l'i when I is exotic. If I is exotic then it has size 2k x 2k for 
some k, and the operator Ad^ll" is given by X i— ► Adi(X), where 

L = , and D = Dk is the Jordan block (|4.3p . 

Let V be unique Ad^-invariant subspace of Mat^ x k (C) that h as co dimension 1 and 
does not contain the identity matrix (which exists by Lemma 14. 3h . Take matrices 
X\, . . . , Xk £ MatA;xfe(C) such that X\ = Id and V = sorbAd D {X 2l ■ ■ . , Xk). Define 
Yi, y fc eMat 2fe x2fe(C) by 



Then 



H«.H, A([A (r, n ) = ^ rfd + * J^): fier 



For j — k + 1, . . . , 2fc, define 



*9 





Xj-k ATj_fc 



Then, by Lemma I3.2I . 

sorb AdL (F fc+ i, . . . ,Y 2k ) = j ; AT, iV e Mat fcxfe (C 

Therefore sorbAd t (^ij ■ ■ ■ , ^2fc) is the transitive space given by (|4.16[) . Since Yi is 
the identity on I, th is shows that rig + (Ad/i|l n ) < 2fc = popj I, concluding the 
proof of Lemma l4.17l □ 

4.7. The final rigidity estimate. Let c = c(A) be the number of equivalence 
classes mod T of eigenvalues of A. 

Lemma 4.18. If c < d then 

rig + Ad^ ^ pop 1 W — c + 1 . 

Proof. Let m = popi W — c + 1. For each island I, let 

r(l) = [i(po Pl 7r r (l) +pop l7 r c (l))J. 

We claim that 

to if I is an equatorial island, 
to — 1 if I is a non-equatorial island. 



(4.17) r(I) < 



Let us postpone the proo f of this and see how to conclude the lemma. 

In view of Lemma 14.171 and relation (I4.17[) , for each island I we can take matrices 
Xi 1, . . . , Xj m e 1° such that: 



^For those who read Appendix 0, notice that the simplest exotic isl and a ppears when A has 
a type 3 constraint; we have dealt with them in the proof of Proposition I A . j 



2« 



BOCHI AND GOURMELON 



• Ai := sorbAd A (^i,i, • • ■ , -X"i,m) is a transitive subspace of 1°; 

• -X"i,m = if I is non-equatorial; 

• X ltTn is the identity in 1° if I is equatorial. 

Define matrices: 

y o ,j = y] Xij (a is a banner class, 1 < j ^ m), 

I is an island 
with banner class a 

Zj= 2 Ya >i 0-^3 <m). 

a is a banner class 

So Z m is the d x d identity matrix. Consider the space 

A = sorbAcu(^i> • • • > z m)- 
It follows from Lemma |3.2| that 

A = sorbAcU {Y a ,j\ a is a banner class, 1 < j < to}. 
We claim that every island I, 
(4.18) AjcAW. 

Indeed, if M e I then we can write M = ]F] • fj(AdA)Xi,ji where the /j's are 
polynomials. Consider N = £L /^(Ad^Y^j-, where a is the banner class of I. It 



follows Lemma PTa (part[TJ that N e AM. This proves (|4.18[) . So, by Lemma 13. Ill 
A is a transitive subspace of Matdxci(C), showing that rig + Ad a < Tn. 

To conclude the proof we have to show estimate (|4.17[) . First consider a equato- 
rial island I. Since there are c equatorial islands, and each of them has a positive 
banner 1 population, we conclude that r(I) to, as claimed. 

Now take a non-equatorial I. Applying what we just proved for the equatorial 
islands 7r r (l) and 7r c (I), we conclude that r(l) < m. Now assume that (|4.17D docs 
not hold for I, that is, r(T) = to. Then 

popj 7r r (l) = popj 7r c (I) = m = popj W — c + 1. 

Since popj W > popi vr r (l) + pop x 7r c (l) + c— 2, we have m = 1 and pop x W = c. This 
means that popj 1 = 1 for all equatorial islands I, which is only possible if c = d. 
However, this case was excluded by hyp othes is. 

This proves (|4.17p and hence Lemma \a .181 □ 



Example 4.19. If A is the matrix of Example |4.1| then Lemma |4.18| gives the estimate 
rig + Ad.A < 28. A more careful analy sis (going through the proofs of the lemmas) would 
give rig + AcIa < 7 (see Example k.llh . 



Proof of part fj of Theorem \3.% Apply Lemmas 14.151 and l4.18l □ 



5. Proof of the hard part of the codimension to theorem 



We showed in Proposition 12.111 that codim'Pm^ < m. In this section, we will 
prove the reverse ineq uali ties. More precisely, we will first prove Theorem ll .91 and 
then deduce Theorem 1.8 from it. 



5.1. Preliminaries on elementary algebraic geometry. 
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5.1.1. Quasiprojective varieties. An algebraic subset of C n is also called an affine 
variety. A projective variety is a subset of CP™ that can be expressed as the zero set 
of a family of homogeneous polynomials in n + 1 variables. The Zariski topology on 
an (affine or projective) variety A is the topology whose closed sets are the (affine 
or projective) subvarieties of A. 

An open subset U of a projective variety X is called a quasiprojective vari- 
ety. We consider in U the induced Zariski topology. The affine space C™ can be 
identified with a quasiprojective variety, namely its image under the embedding 
(zt, . . . , z„) i-> (1 : zi : ••• : z„). 

If X and y are quasi- projective varieties then the product X xY can be identified 
with a quasiprojective variety, namely its image under the Segre embedding; see 

m § 5.i]. 

The following is an important and very useful property of projective varieties. 
(See [SB p. 58] for a proof). 

Proposition 5.1. If X is a projective variety and Y is a quasiprojective variety 
then the projection p: X x Y — > Y takes Zariski closed sets to Zariski closed sets. 

A quasiprojective variety is called irreducible if it cannot be written as a non- 
trivial union of two quasiprojective varieties (that is, none contains the other). 

5.1.2. Dimension. The dimension dim A of an irreducible quasiprojective variety 
X may be defined in various equivalent ways (see for instance |Ha[ p. 133ff]). It will 
be sufficient for us to know that there exists an (intrinsically defined) subvariety Y 
of the singular points of X such that in a neighborhood of each point of X \ Y, the 
set X is a complex submanifold of dimension (in the classical sense of differential 
geometry) dim A; moreover, each irreducible component of Y has dimension strictly 
less than dimX. 

The dimension of a general quasiprojective variety is by definition the maximum 
of the dimensions of the irreducible components. 

Remark 5.2. The dimension of a quasiprojective variety U a CP™ coincides with the 
dimension of its Zariski-closure in CP™ (see [Hal p. 135]). 

The following lemma is useful to estimate the codimension of an algebraic set X 
from information about the fibers of a certain projection -k: X — > Yri 

Lemma 5.3. Let Y be a quasiprojective variety. Let X cz Y x CP 71 be a nonempty 
algebraically closed set. Let ir: X — > Y be the projection along CP™. Then: 

1. For each j > 0, the set 

C j = {V e Y ', codim7r _1 (y) < j} 
is algebraically closed in Y . 

2. The dimension of X is given in terms of the dimensions of the Cj 's by: 

(5.1) codimA = min (j + codimC,-) . 

The lemma is a consequence of standard theorems in algebraic geometry but for 
the reader's convenience let us spell out the details. 



Proof of Lemma \5.,% In what follows, all topologies are of course Zariski. We will 
prove the equivalent "dual form" of the lemma, namely, that the sets 

Y k = {yen(X); dim^y) > k} 

are algebraically closed in Y, and 

(5.2) dim A = max (k + dimYh). 

fc; Y k ^0 V ' 



14 



A slightly similar result is SW, Prop. 16] 
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First, the sets Xj~ = {x e X; d im7r~ 1 (7r(a;)) >j fc} are closed, (see |Ha[ 
Thrm. 11.12]). So, by Proposition l5.ll Y k = 7r(Xfc) is closed. 

For each k with A& 0, let -Xfc^ indicate the irreducible components of X k . Let 

/jb(k, i) = min dim7r — (7r(x)) . 

Then, by [Hal Thrm. 11.12] (and Remark [El) , 

dimXk^i = n(k,i) + dirm:(X kti ) . 

By definition, /i(fc,«) ^ fc; moreover equality holds unless Xj.^ cz X k +i- So 

Xfc,; c± X k+ i => dimX fe)i = fc + dim7r(X fc ^) < fc + dim F fe . 

Since X = [J Xfc x fc+1 this proves the < inequality in (|5.2p . 

To prove the converse inequality, fix any k with Yfe ^ 0. Find i such that 
dim7r(Xfc ! i) = dimYfc. Then 

dimX > dim X kt i = fJ-(k,i) + dimYfe > k + dim Y k . 

This proves (|5.2I) and hence the lemma. □ 

Remark 5.4. Lemma |5.3| works with the same statement if CP n is replaced by C" +1 , 
provided one assumes that X cz Y x C n+1 is homogeneous in the second factor (i.e., 
(y, z) e X implies [y, tz) E X for every teC). Indeed, this follows from the fact that the 
projection C n+1 s {0} — ► CP™ preserves codimension of homogeneous sets. 

5.1.3. Dimension estimates for sets of vector subspaces. If M e Mat„ x m (K) , let 
colM c K" denote the column space of M. A set X cz Mat„ xm (K) is called 
column-invariant if 

Mel 1 
iV 6 Mat nxm (K) I => NeX. 
col M = col TV J 

So a column-invariant set X is characterized by its set of column spaces. We enlarge 
the latter set by including also subspaces, thus defining: 

(5.3) \X\ := {E subspace of K"; E cz col M for some M e X}. 

Then we have: 

Theorem 5.5. Let X cz Mat nX m(C) be an algebraically closed, column-invariant 
set. Suppose E is a vector subspace of C n that does not belong to [X] . Then 

codim X ^ m + 1 — dim E . 



Thcorcm l5.5l follows without difficulty from intersection theory of the grassman- 
nians ("Schubert calculus"). The proof is given in BG1 03 

5.1.4. The real part of an algebraic set. Let X be an algebraically closed subset of 
C". The real part of X is defined as X n R". This is an algebraically closed subset 
of K n . Indeed, generators of the corresponding ideal fi, ■ ■ ■ , fk in C[Ti, . . . , T n ] can 
be replaced by the corresponding real and imaginary parts polynomials. 

As in the complex case, there are many equivalent definitions of dimensions of 
real algebraic or semialgebraic sets, in algebraic geometry. We just point out that 
a real algebraic or semialgebraic set admits a stratification into real manifolds such 
that the maximal differential geometry dimension of the strata coincides with the 
algebraic geometry dimension (see |BCRI p. 50]). 

Proposition 5.6. If X is an algebraically closed subset ofC n then dimR(A nK") =S 
dime X . 



^Theorem lS-^ has applications apart from the one given in the present paper. The proof given 
in IBG1I will be incorporated in |BG2I . 
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We refer the reader to |Wh| for the definition of the map rnk which to a real 
(resp. complex) variety V and to point p e V associates the real (resp. complex) 
rank of V at p. In the same paper, the author shows the two following results: 

• for any (real or complex) variety V, for any point p e V the rank rnk p (V) is 
greater than the codimension of V. 

• If V is real (resp. complex) there is a point p e V such that the real (resp. 
complex) rank satisfies rnk p (V") = codimR(V) (resp. rnk p (V) = codimc(V)). 

• Given a real variety V c R n , there is a unique smallest complex variety 
V* cz C" containing V (in particular, V is the real part of V*). Then we 
have rnk p (V"*) = rnkp(V). 



Proof of Provosition \5.a . Let V be the real variety X n R n . Let p e V such 
that rnkp(V) = codimR(l^). Consider the unique smallest complex variety then 
rnk p (V"*) = mkp(V). In particular codim R (V r *) > codim c (F*). Since V* cz X, 
the proposition follows. □ 

5.2. Rigidity and the dimension of the poor fibers. For simplicity of notation, 
let us write V m = V m ■ Also, for A e GL(cZ, C), write: 

r(A) := rig + Ad A - 1 • 
We decompose the set V m of poor data in fibers: 
(5.4) V m = (J {A}xV m (A), where V m (A) cz gl(d, C) m . 

AeGL(d,C) 

Lemma 5.7. For any Ae GL(d, C), the codimension ofV m (A) in Ql(d,C) m is at 
least m + 1 — r(A). 



The lemma follows easily from Theorem 15.51 above: 

Proof. Fix A e GL(cZ, C), and write r = r(A). We can assume that r < m, otherwise 
there is nothing to prove. By definition, there exists a r-dimensional subspace 
E cz Ql(d,C) m such that sorb A d A (Id v E) is transitive. Identify fl[(d,C) with C d2 
and thus regard V m {A) as a sub set o f Mat d 2 x m (C) . Since the set V m is algebraically 
closed and saturated (recall § 12.31 ). the fib er V rn (A) is algebraically closed and 
column-invariant, as required by T heor em l5.5l In the notation (|5.3[) . we have 
E $ fP m (A)J. So applying Theorem I5.5L the lemma is proved. □ 

5.3. How rare is high rigidity? For simplicity of notation, let us write: 

a{A) := acycAd A for A e GL(d,C). 

So Theorem Essays that r(A) < a(A) - c{A) provided c(A) < d. 

Lemma 5.8. For any integer k ^ 1, the set 

M k = {Ae GL(d, C); r(A) > k}; 

is algebraically closed in GL(d, C); moreover if / then 

ifk=l, 



codim Mh 



> k ifk>2. 



Lemma 15.81 is basically a consequence of Theorem 13.71 . using the following con- 
struction: 

Lemma 5.9. There is a family Q{A) of subsets of GL(e?, C), indexed by A e 
GL(g?, C), such that the following properties hold: 

• Each Q(A) contains A. 

• Each G(A) is an immersed manifold of codimension a{A) — c(A). 
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There are only countably many different sets G(A). 



The informal proof of the lemma goes as follows: For each A e GL(d, C), let 
G(A) be the set of matrices that have the same Jordan type as A (as defined in 
Remark l4.16t , and (at least) the same mod T relations betw een the eigenvalues. 
Then G(A) contains the conjugacy class of A, which by Rcmark l3.4l has codimension 
a(A). We can also move the eigenvalues (keeping the mod T relations); this gives 
c(A) extra degrees of freedom, so the codimension of G(A) is a(A) — c(A). Since 
there are only finitely many Jordan types of d x d matrices, and only countably 
many mod T rel ation s, there are only countably many different sets G(A). A formal 
proof of Lemma IBlil follows: 



Proof. First suppose that A e GL(d, C) is a matrix in Jordan form 
/# tl (Ai) \ f X 1 \ 



A 



where B t (X) : = 



V 



Let c = c(A) ; by the definition 
fix, . . . ,n c e C* , 
9ifiki for each i 



V 



1 

A/ 



eMat txt (C). 



such that A 
such that 

(5.5) 
(5.6) 

Define a map $ : U - 



i. 



, we can choose numbers 
.,9 n eT, ki,...,k n e {l,...,c} 
. . . , n. Let U be the subset of (yi, . 



,2/ c )eC c 



yk for each k, 
i ■■ j ■■■ f >,!/,.. Q jVk . 

GL(d, C) by: 

'B ni {Q x y k A 

,Vc) 



B nk (0 n y kn )J 



For every y £ U, conditio n (15.61) assures that $(y) has the same Jordan type as A, 
and therefore, by Remark 1 4.161 . a($(z/)) = a(A). 

We define the set G(A) as the image of the map \& = ^a- GL(d, C) x U — » 
GL(d,C) given by 9(X,y) = M x (*(y)). 

Let us check that property 15.91 holds. Let di^ and d% ^ denote the pa rtial 
derivatives with respect to X and y 7 respectively. As we have seen in Remark l3.4l 
the rank of (5i v E'(A, y) is equal to d 2 — a($(y)) = d 2 — a(A) for every (X,y). We 
claim that 



(5.7) 



(d 2 ^>(X,y))- 1 (image of d^(X,y)) = {0}; 



To see this, consider the map T: Matdxd(C) — * C d that associates to each matrix 
the coefficients of its characteristic polynomial. Then di(T o ty)(X,y) = 0, while 
52(ro\I>)(0, 0) is one-to-one. So (I5.7[) follows. As a result, the rank of the derivative 
of \& is equal to d 2 — a(A) + c(A) at every point. Therefore, by the Rank Theorem, 
the image of \& is an immersed manifold of codimension a(A) — c(A). 

For arbitrary A e GL(d, C), we define G(A) = G(Aq), where A$ is the Jordan 
form of A. Each set G(A) depends only on the data n and (i,-, 9i, fei)*=i,.",ni therefore 
there are only countably many different sets G(A). □ 



Remark 5.10. It is not difficult to show that each Q(A) is a actually a submanifold of 
GL(d, C), but we won't need this. 
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Proof of Lemma \5A . If k = 1 then Mi = GL(d, C) (since d > 2), so there is nothing 
to prove. Consider fc > 2. We have already shown in § 12.31 that Vu is algebraic. 
Since M k = {A e GL(d,C); VX e 0l(d,C) fc , e P fe }, it is evident that M k is 

algebraically closed as well. We are left to estimate its dimension. 

Take a nonsingular point Ao of M k where the local dimension is maximal. Let 
D be the intersection of M k with a small neighborhood of Aq; it is an e mbedded 
disk. Each A e D has r(A) > 2; therefore by (both parts of) T heor em 13.71 we have 
a(A) — c(A) 5= r(A) ^ k. So, in terms of the sets from Lemma [HT 



D 



U 



5(1)- 



A s.t. a(A)-c(A)^k 



The right hand side is a countable union of immersed manifolds of codimension at 
least k. It follows (e.g. by Baire Theorem) that D (and hence Mk) has codimension 
at least k. □ 



5.4. Proof of Theorem ll.9l . Now we apply Lemmas 15.71 and 15.81 to prove one of 
our major results: 



Proof of Theorem ] 1 A . The s et V m c GL(d, C) x [gl(d, C]l m is homogeneous in the 
second factor. Using Lemma [5^ together with Remark 1 5. 4 . we obtain that the sets 

(5.8) Cj = {A e GL(d,C); codim7> m (A) < j} 

are algebraically closed in GL(d, C), and 



codim Vn 



min (i + codim C,- , 



By Lemma 15.71 . we have Cj c M m+ i_o. Therefore, by Lemma 15. 8 



(5.9) 



codim d 



5* 



if j = m, 



^ m — j + 1 if j < m — 1. 



So codim V m > m, as we wanted to show. 



□ 



The proof above only used that codim Cj 5= m — j. On the other hand, using 
the full power of (|5.9[) we obtain: 

Scholium 5.11. The set of poor data in "fat fibers", namely 

T m := {(A, Bi, ... , B m ) e codim V m {A) < m - l}, 

/ias codimension at least m + 1 m GL(rf, C) x [g[(rf, C)] m . 



Proof. The projection of .F m on GL(d, C) is C m _i. Use Lemma l5.3l (together with 
Remark [13) and ([5J)l . □ 



5.5. The real case. 



Proof of Th eore m li.il . The real part of Vm^ 1 is a real alge braic set which, in view of 



Proposition l5.6l . has codimension at least m. Recall from § l2.3l that this set contains 
the semialgebraic set Vm \ which therefore has codimension at least m. Since we 

I 1 (M.) 

already knew from Proposition ^. Ill that V m ^ m, the theorem is proved. □ 
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5.6. Additional information. Let us im prove upon Scholium l5.1ll and so prepare 



the ground for the proof of Theorem ll.2l . This part is not necessary for the proof 
of Theorem ll.ll . 



Recall from § 12.31 the definition of saturated set. 

Lemma 5.12. There exists a saturated algebraically closed set S m cz GL(d, C) x 
[Matdx d(C)] m of codimension at least m + 1 such that for all (A, B\, ... , B m ) e 
V m \ S m , the following properties hold: 

1. A is unconstrained; 

2. if P e GL(d, C) is such that P~ 1 AP is a diagonal matrix then there are 
indices io, jo e {1, . . . , d} with io ¥= jo such that for each k e {1, . . . , m}, the 
(*0;jo) entry of the matrix P" 1 B^P vanishes; 

3. for each choice of P above, the off-diagonal vanishing entry position (io,jo) 
is unique. 

Notice that each d ata in V m \ S m , after a change of basis, satisfies precisely the 



hypotheses of Lemma 12.14 . 



In order to prove the lemma, we begin by checking algebraicity of the constraints: 

Lemma 5.13. The set K cz GL(d, C) of constrained matrices is an algebraically 
closed subset of codimension 1. 

Proof. Multiply all constraints, obtaining a polynomial in the variables Ai, 
Xd- This polynomial is symmetric, and therefore (see e.g. [La . Thrm. IV. 6.1]) can 
be written as a polynomial function of the elementary symmetric polynomials in 
the variables Ai, . . . , Xd- Now substitute each elementary symmetric polynomial 
in this expression by the corresponding coefficient of the characteristic polynomial 
of the matrix A. This gives a polynomial function on the entries of the matrix A 
that vanishes if and only if A is constrained. It is obvious that the corresponding 
algebraic set K has codimension 1. □ 

Now we check algebraicity of double vanishing: 

Lemma 5.14. There exists a saturated algebraically closed subset T> of &L(d, C) x 
[Matrix ri(C)] m such tha t if ( A, B\, . . . , B m ) e T> and A has simple spectrum then 



property^ from Lemma \5. IB is satisfied, but property^ is not. 

Proof. First, consider the subset X cz [Matrixd(C)] 1+m x (CP^ 1 ) 2 formed by tuples 
(A, Bi, . . . , B m , [v], [w]) such that 

[Av] = [v], [A*w] = [w], w*v = 0, w*B k v = for each k = l,...,m, 

where v and w are regarded as column-vectors and th e st ar denotes transposition. 



The set X is obviously algebraic; thus, by Proposition |5.1| . so is its projection Y on 
[Matrix d(C)] 1+m . 

Let A be a matrix with simple spe ctrum . Then (A, B\, ... , B m ) belongs to Y 
if and only if property [2] from Lemma 5.121 is satisfied. In particular, the fiber of 



Y over A is a union of affine subspaces of [Matrix d(C)] m . Intersections of those 
affinc spaces correspond to points where the uniqueness property [3] is not satisfied. 
These points of intersection are singular points of Y. Conversely, it is clear that the 
variety Y is smooth at the points on the fiber over A where property [3] is satisfied. 

So let Z be the (algebraically closed) set of singular points of Y. It is straight- 
forward to see that the set Y is saturated. Recalling Remark 12.81 (part [T]) and the 
fact that a group acting on a variety preserves singular points, we see that the set 
Z is saturated as well. 

We define T> as the set Z minus the tuples (A, B\ , . . . , B m ) with det A = 0. Then 

V has all the required properties. □ 



UNIVERSAL REGULAR CONTROL 



35 



Proof of Lemma \5.1& . For simplicity of writing we will omit the m subscripts. 
Let it : *P — ► GL(d, C) be the projection on the first matrix. Define 

S = tt- x (K)u (V nV), 

where K and T> come respectively from Lemmas 15.131 and 5.14 . Then S is a satu- 



rated algebraically closed subset of V . If A = (A, B\, ... , B m ) e V \ S then: 
• A £ K, which is property [T] 



• since A e V, it follows from Lemma 12.131 that A is conspicuously poor, and 
so property [2] holds; 

• since A T>, property [3] also holds. 

To complete the proof of the lemma, we need to show that codiniiS > m + 1. 
We will use the following inclusion: 

(5.10) 5c Ju {?~ 1 {K) \ J 7 ) u ((£> nP)\ ^(K)) . 

v v ' v v ' 

where T comes from Scholium 15. Ill Recall that T equals 7r _1 (C TO _i), where Cj is 



given by (|5.8[) . and it ha s codimensio n at least m + 1. 



We apply Lemma 15.31 and Remark |5.4| to the set T' cz Y' x [gl(d, C)] m , where 



Y' = GL(d, C) s C m -i- Since K has codimension at least 1 in Y', and the fibers 
of J- 1 all have codimension at least rn^we conclude that that codim T 1 > m + 1. 



Next, we want to apply Lemma 15.31 and Remark 15.41 to the set J-" cz Y" x 



[fl[(d, C)] m , where Y" = GL(d, C)\K. For each A e Y", it follows from Lemmal5T4 



that the fiber of T" over A (which is the same as the fiber of T> over A) has 
codimension 2m in [g[(d, C)] m , corresponding to the 2m different matrix entries 
that must vanish. We conclude that codim J 7 " > 2m. 

We have seen that each of the three sets on the right-hand side of (|5.10p has 
codimension at least m + 1. So the same is true for S, as we wanted to prove. □ 

6. Proof of the main results 

6.1. Stratifications. We first recall a few notions about stratifications. We refer 
the reader to [GWPL, Ma for details and proofs. 

Let S be a closed subset of a smooth (i.e., C 00 ) manifold X. A smooth stratifi- 
cation of S is a filtration by closed subsets 

S = S n ZJ S n _l ZD ■ ■ ■ ZD Y,Q 

such that and for each i, the set Xj = Sj \ (where E_i := 0) cither is a 

smooth submanifold of M without boundary and of dimension i, or is empty. Each 
connected component of Xi is called a stratum. The codimension of a stratified 
space is the lowest codimension of strata. This does not depend on the choice of 
the stratification. 

Note that, apart for discrete subsets £ cz X, if there is one smooth stratification, 
then there are infinitely many others. However, the subsets that we will be dealing 
with will be endowed with certain canonical stratifications: 

Theorem 6.1 (Existence of canonical stratifications). Any algebraic set E cz C N 
admits a canonical smooth stratification, whose strata are complex submanifolds of 
C N . Any closed semialgebraic set S cz ~R N admits a canonical smooth stratification, 
whose strata are semialgebraic submanifolds of R w . 

In the case of an irreducible algebraic set E cz C", the canonical stratification 
can be obtained as follows: The connected components of the set of regular (i.e., 
non-singular) points form the higher-dimensional strata; then one decomposes the 
set of singular points of E into irreducible components and proceeds by induction. 
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In any case, those canonical stratifications are uniquely characterized by a certain 
minimality property. In particular, the canonical stratifications are equivariant 
under polynomial automorphisms of the ambient space. 

Another important property of the canonical stratifications is that they sat- 
isfy the so-called Whitney conditions. We will not recall here those conditions, 
which would be rather technical; we will only write down some of their properties. 
A smooth stratification that satisfies the Whitney conditions is called a Whitney 
stratification. 

Proposition 6.2 (Basic properties of Whitney stratifications). Let X , Y be smooth 
manifolds. Let 

(6.1) E„ => • • • => £ 

be a filtration of a set £ cz X . Then: 

1. Being a Whitney stratification is a local property of a filtration: So if (|6.1|) 
is a Whitney stratification of £ then £„ n U zd ■ ■ ■ zd £ n U is a Whitney 
stratification o/ £ n £/, and conversely if each point in £ has an open neigh- 
borhood U cz X such that S„ n U zd ■ ■ ■ zd £ n U is a Whitney stratification 
of £ n U then (|6.ip is a Whitney stratification of £ . 

2. // (|6.1I) is a Whitney stratification o/£ then £„ x Y zz> ■ ■ ■ zz> So x Y is is a 
Whitney stratification ofExYczXxY. 

3. // (|6.1[) is a Whitney stratification of £ and f:X—>Y is a smooth diffeo- 
morphism then /(£ n ) zd ■ ■ ■ zd /(£o) is a Whitney stratification o//(£) cz Y . 

Let us now discuss how stratifications behave with respect to transversality. 
Let / : X —> Y be a C 1 map. Let E = E,; d • ■ o S be a stratification of a 
closed subset £ of Y. One says that / is transverse to that stratification if it is 
transverse to each of its strata. Transversality to a general stratification is not an 
open condition. However, we obtain openness if the stratification is Whitney: 

Proposition 6.3 (Transversality is open). Let X, Y be C 00 manifolds without 
boundary. Let £ = £<j zd ■ ■ ■ zd £ be a Whitney stratification of a closed subset of 
Y. Then the set O = {/ e C^XjY); /SE} is open in C^X.F) (with respect to 
the strong topology). 

Actually, only the first of the Whitney conditions is necessary here (use the 
(1)=>(3) implication of Trotman's theorem |Tr]). 

6.2. Jets and jet transversality. We recall the basic notions on jets and state 
the transversality theorems we will need; see [HI] for details. 

Let X, Y be smooth manifolds without boundary. If 1 ^ r < go, an r-jet from 
X to Y is an equivalence class of pairs (x, /), where x e X, f is a C r map from a 
neighborhood of x to Y, and where (x, f) is equivalent to (x' , /') if x = x' and / 
and /' have same derivatives at x up to order r. We denote by J r (X, Y) the space 
of r-jets from X to Y. It is a smooth manifold. 

For all 1 s ^ go, we denote by C S (X, Y) the space of C s -maps from X to Y, 
endowed with the strong topology. 

Given 1 ^ r < s ^ go and a map g e C S (X, Y), the r-jet extension is the map 
j r g: X —* J r (X, Y) that sends x to the equivalence class j r g(x) of (x,g). Then the 
mapping 

f : C S (X, Y) — ► C s - r (A, J r (A, Y)) 

is continuous. 

Theorem 6.4 (Jet transversality). Let 1 r < s ^ go. Let X and Y be C 00 mani- 
folds without boundary. Let W <zz J r (A, Y) be a C 00 submanifold without boundary. 
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Then the C s -maps g: X — > Y for which the r-jet extension j r g is transverse to W 
form a residual subset of C s (X,Y) . 

Let us now show the following: 

Proposition 6.5. Let X , Y be C 00 -manifolds without boundary. Let £ c Y) 
be a Whitney stratified closed subset. Then {/ e C 2 (X,Y); j f rK £} is C 2 -open 
and C 00 -dense in C 2 (X,Y). 

Here, as in the introduction, we say that a subset of C 2 (X, Y) is C^-dense if its 
intersection with C r (X, Y) is C-dense, for every r 5= 2. 

Proof of Proposition\KE By Proposition EJ2 the set {F: X — » ^(X^); FSEj 
is open in C 1 (X,J 1 (X,Y)). Hence the set O := {/: X -> F; j 1 /^ £} is open in 

c 2 (x, y). 

Fix r ^ 2. Given a Whitney stratification £„ 3 • • • 3 Eo of S, let Zi = 
Sj \ Sj_i be the corresponding dec omp osition into smooth submanifolds. By the 
jet transversality theorem (Theorem [^J) , each set TZi = {/ e C r (X, Y)\ j 1 f rK Z{\ 
is residual. Thus n C r {X, Y) = f). 7?.^ is C r -dense. This concludes the proof. □ 



6.3. Proof of the main result. We now use Theorem 1 1 . 8l and the tools explained 
above to prove our main result. Before going into the proof itself, let us deal with 
a technical detail. 
By Theorem 



1.8 . Vm is a closed semialgebraic subset of GL(e?, R) x gl(<i, R) r 
6.1 



Since Theorem l6.ll concerns semialgebraic subsets of affine space, we proceed as 
follows. First, enlarge Vm by including all [A, B\ , . . . , B m ) with det A = 0, thus 
obtainin g a s ubset T of [Matdxd(K)] 1+m which is also closed and semialgebraic. By 
Theorem l6.lL the set T admits a canonical Whitney stratification 

f = f n =) • • • => f o . 

Now we remove all (A, B \ , . . . , B m ) with det A = from each T^, thus (by locality 
property [1] in Proposition obtaining a Whitney stratification of codimension m: 

(6.2) V& = T n 3 ■ ■ ■ 3 r . 

(We may have r„ = r„_i.) Since the stratification of T is canonical, the strati- 
fication (|6.2[) is invariant under polynomial automorphisms of the set GL(ci, M) x 
0[(d,M) m that preserve vffl. 

Proof of Theorem Let U be a smooth manifold without boundary and of di- 
mension to. Given local coordinates on an open set U c U, the set of 1-jets from 
U to GL(d, R) may be identified with the set 

U x GL(d, M) x fl [(d,M) m . 

Indeed, a jet J represented by a pair (u, A) can be identified with the point 

(u,A(u),B 1 ,...,B m ) e U x GL(d,R) x fl [((i,M) m , 

where Bi e Matdxd(K) is the normalized derivative of A at u, along the i th coordi- 
nate. Let us say that the 1-jet J is rich if the data A = (A(u), B\ 1 . . . , B m ) is rich, 
or equivalently, if for sufficiently large N, the input (u, . . . , u) £ U N is universally 
regular for the system (|1.4[) . If the jet is not rich then it is called poor. 
Define a filtration 

(6.3) E„3..oE 

of the set of poor jets from U to GL(rf, R) as follows: a jet J represented as 
above in local coordinates by (u, A(u), Bi, . . . , B m ) belongs to Sj if and only if 
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(A(u),Bi, . . . , B m ) belongs to the set I\ in (|6.2[) . We need to check that this defi- 
nition does not depend on the choice of the local coordinates. Indeed, this follows 



from the fact that Vm is a saturated set (see § 12.31 ) using the invariance property 
of the stratification (|6.2I) explained above. 

We claim that the filtration (|6.3|) is a Whitney stratification of codimension m. 
Indeed, the intersection of the filtration with the open subset J 1 (C/, GL(d, K)) of 
J 1 (W, GL(d, K)) is identified (through a smooth diffeomorphism) with the filtration 

u x r„ ~> ■■■ => u x r . 



So the claim follows from Proposition 16.2 



Applying Proposition l6.5l we obtain a C 2 -open C°°-dense set O c C 2 (U, GL(d, C)) 
formed by maps A that are transverse to the stratification (|6.3p of the set of poor 
jets. Since the codimension of the stratification equals the dimension of the man- 
ifold U, if A e O then the poi nts u for which j 1 A(u) is poor form a O-dimensional 



set. This proves Theorem ll.il . □ 



/TO -1 

Remark 6.6. In the proof above, instead of working with the semialgebraic set Vm. , 
we could have worked equally well with the real part of Vm , since it is an algebraic set 
containing Vm and has the same codimension. 

6.4. Proof of the addendum. 



Proof of Theorem Consider the set sffl given by Lemma I5.12L and let Sm 1 
be its real part. This is an alge brai cally closed saturated subset of GL(d, R) x 
[gl(d, M)] m which, by Proposition I5.6L has codimension at least m+ 1. 

Consider the set T of 1-jets J e J 1 (U,GL(d,C)) that have a local expression 
(u, A(u),Bx, . . . , B m ) with (A(u), Bx, . . . , B m ) e Sm - This does not depend on the 
choice of the local coordinat es, b ecause Sm ^ is saturated. By the same arguments 
as in the proof of Theorem ll.il . the set L admits a Whitney stratification. Its 
codimension is at least m + 1. Applying Proposition 16.51 we obtain a C 2 -open 
C^-dense set O c C 2 (U, GL(d, C)) formed by maps A that are transverse to the 
stratification. 

Let O be the set provided by Theorem ll.il . and consider a map A e OnO . Then 
whenever a jet j 1 A(u) is poor, it does not belong to T. Recalling Lemma l5.12l we see 
that the lo cal ex pression of ^Aiu) satisfies (after a change of basis) the hypotheses 
of Lemma l2.l4 Therefore parts Q] and [5] of the theorem follow respectively from 
conclusions [T] and [5] of the lemma. □ 

Remark 6.7. The proof of Theorem [l^ also gives more information about the 1-jets that 
appear generically for sin gula r constant inputs (u,...,u): the associated matr ix data is 
conspicuously poor (see § 12.41 ). and the matrix A(u) is unconstrained (see § 12.51 ). 

Remark 6.8. Properties Q] and [2] in Theorem ll.2l are in fact dual to each other. If A 
is the data representing the 1-jet of A at u, and A = A(A), then property [T] means that 
there is an unique direction [v] e RP d_1 such that A • v ^ C d . Then property [2] means 
that there is an unique direction [w] e RP d_1 such that A* ■ w ^ C d , where A is the 
set of the transposes of th e mat rices in A. This fact can be proved easily using the dual 
characterization of Lemma [3.121 . 

Appendix A. The case of one-dimensional input 

As we explained in this appendix contains a basically independent dis- 

cussio n of the case dimW = 1. The prerequisites are all contained in Section 



and § l3.ll . In order to avoid technicalities at this point, we will be sometimes 



informal, especially regarding questions of transversality. 
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Let us define the canonical constraints respectively of type 1, 2, 3, 4 as the 
following relations: 



(A.l) AiA 3 
Recall from 



A 



A1A4 — A2A3, 



Ai 



-A, 



2.5l that a constraint between variables Ai , 



Ai — A2 . 

, Xd is a relation that 
can be reduced to one of the four canonical constraints after a change of indices. 
Each constraint has a unique type. 

Let us say that a matrix A e GL(rf,R) is (i)- constrained, for 1 < i < 4 if: 

• its eigenvalues, counted with multiplicity, satisfy exactly one elementary 
constraint, which is a type i constraint, 

• if there is a type 4 constraint between the eigenvalues, then the matrix A is 
not diagonalizable. 

Hence if a matrix A is not (i)-constrained for any < i < 4, then 

• either A is unconstrained, i.e., its eigenvalues (with multiplicity) satisfy no 
constraint; 

• or the eigenvalues of A satisfy at least two constraints; 

• or A has a (multiple) eigenvalue corresponding to at least two Jordan blocks. 
If either of the last two cases hold, we say that A is multiconstrained. 

Proposition A.l. 1. The complement of the set of unconstrained matrices has 
codimension 1 in GL(d, M). 
2. The set of multiconstrained matrices has codimension 2 in GL(d, R). 

Informal proof. Matrices that are not unconstrained have at least one constraint 
on their eigenvalues, so the corresponding set has codimension 1. 

Matrices that are very constrained either have at least two constraints on their 
eigenvalues, or have an eigenvalue of multiplicity 2 and are diagonalizable. In both 
cases, the corresponding set has codimension 2. □ 

Let us define adapted basis for matrices A that are not multiconstrained: 

• If A is unconstrained then an adapted basis is a basis of eigenvectors. 

• If A is (i)-constrained, for i = 1, 2, or 3 then an adapted basis is an (ordered) 
basis of eigenvectors such that the corresponding eigenvectors Ai, . . . Xd sat- 
isfy the canonical type i constraint. 

• If A is (4)-constrained then an adapted basis for A is a basis in which A is 
written in the following modified Jordan forrrf^l: 







Ai 
Ai 



\ 



Obviously, such adapted basis always exist. 

If a matrix A is (i)-constrained then we say that adxd matrix B is a good match 
for A, if there is an adapted basis for A in which it writes as B = (6y), where all 
nondiagonal entries bij are nonzero and if 611 ^ 622, in the particular case where A 
is 3-constrained. 

The use fulness of this definition is explained by the following Propositions I A. 2| 
and lA.j 1 " 



^The reason for using a modified Jordan form is that it makes the expression of Ad^ simpler, 
as we will see later. 

^Actually, the definition of a good match matrix is stronger than necessary for Proposition ! A. j 
to be true. But in order to avoid complications, we chose a condition that works for all types of 
constraints. 
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Proposition A. 2. If A is not multiconstrained and B is a good match for A then 
the pair (A, B) is rich. 

(C) 

In other words, V{ is contained in the following set: 

(A. 2) £ := {(A,B) e GL(d, C) x g[(d, C); either A is multiconstrained 

or A is not multiconstrained but B is not a good match for A}. 

Proposition A. 3. 1. The set £ has codimension 1. 

2. The set {(A, B) e £ ; A is not unconstrained} has codimension 2. 

Informal proof. This follows from Proposition lA.il and the fact that for each matrix 
A that is not multiconstrained, the set of B's that are not good matches for A has 
positive codimension in gl(d, C). □ 

Theorem in the case m = 1 follows f r om the prop ositions above. Therefore 
the other main results (Theorems O, Gil Q and [cj) in the m = 1 case also 



follow from the propositions. For any of these results, the propositions give extra 
information of practical value: with the explicit d efin ition of the set £ in (| A.2|) . we 



know which 1-jets should be avoided in Theorem ll.il . for example. The discussion 
given in Appendix [b| also applies; it gives explicit conditions on the 2-jet exten- 
sion of th e m ap A : U — » GL(d, R) that assure that A satisfies the conclusions of 
Theorems O and O- 



Proof of Provosition \AA . Let A and B satisfy the hypotheses. We need to show 
that the space A(A, B) defined by (|2.2p is a transitive subspace of gl(d, C). Let 
F := sorb A d A (-B), so that A(A, B) = {Id} v F. 

The matrix A is not multiconstrined and so has an adapted basis as above. We 
change the basis so that A and B are "canonical" . 

The proof is divided in cases according to the type of constraint. Except for the 
(4)-constrained case, the matrix A is diagonal, and so the space T is described by 

(USD. 

Unconstrained case: It follows from Lemma 12.13 that if A is unconstrained and 



diagonal then the only way for the pair (A, B) to be poor is that B h as an off- 
diagonal zero entry. (The reader should review the proof of Lemma 12.131 .) 

(1) -constrained case: We see that the adjoint has two eigenvalues (different from 
1) of multiplicity 2, namely AiA^ = A2A^ and A2A^ = X^X^ ■ By the same 
reasoning as in the unconstrained case, it follows that {Id} v T contains the space 

{{yij) e gi( rf > c ); yn = • • • = Vdd , b^yi2 = b^y 2 y, , kfiW = 632^32}- 



This is a generalized Toeplitz space, and so by Example 12.21 it is transitive. 

(2) -constrained case: The reasoning is very similar to that of the (l)-constrained 
case, but now the adjoint has four eigenvalues (different from 1) of multiplicity 2. 
The space A(A, B) contains the following subspace: 

{{yij) e l ( d > C ); 2/11 = ' ' ' = Vdd , K31J13 = &2~42/24 , 

bizVlZ = , ^21^21 = &I3V3 , &3~1 2/31 = &34W}- 

Again, this is a generalized Toeplitz space, and so it is transitive. 

(3) -constrained case: This case is a little different from the two previous ones. 
The adjoint has an eigenvalue —1 of multiplicity 2. Recalling that 611 and 622 are 
different, and making use of the identity matrix, we see that A(A,£?) contains the 
following subspace: 

f = {(yij) e fll( d > c ); 2/33 = • • • = Vdd , b^y l2 = b^y 2 i}- 



UNIVERSAL REGULAR CONTROL 



41 



This is not a generalized Toeplitz space. However, consider the linear automorphism 
S that swaps the first two elements of the canonical basis of C", and fixes the others. 
Then 

S ■ f = {(Zij) e gl(d, C); z 33 = • • • = z dd , b^zaa = b 2 i z n} 



is a generalized Toeplitz space! By Remark l2.3l the space S ■ T is transitive, and so 
are f and A(A,B). 

(A) -constrained case: This case is more involved because the operator Ad A is not 
diagonalizable. We will explain its Jordan form. Let us explain visually how Ad^ 
acts: given any matrix, decompose it into blocks CVj as in the following picture 



/ C22 



C 



:!2 



C42 



V c d2 



23 



G 



2 1 



C44 



C 2d \ 



CddJ 



where the block C22 is a 2 x 2 matrix, the blocks C 2 j are 2x1, the blocks Ci 2 are 
1x2 and the others are lxl. Then, the operator Ad^ leaves invariant the space 
Tij of matrices whose nonzero coefficients lie inside the block Cy. Moreover, it is 
easily computed that the operator Ad^ has the following properties: 

• restricting to the space 1^2, which we canonically identify to g[(2,C), one 
has: 



Ad A 
Ad A 





1 

1 





1 -1 
1 -1 

1 -1 




Ad A 
Ad A 





1 

1 





1 

1 

1 





One then easily computes that, in the ordered basis formed by vectors 



J^ 







■h 





1 



1 
1 



the matrix of Ad^lTn is 



1 1 


\ 


1 1 




1 






1 / 



V 

• For any j ^ 3, identifying T 2 j to the space of 2 x 1 matrices, the matrix of 



Ad 4 |r 



2; 



A2AT 1 





and E 2 j 



x 2 x: 



in the basis formed by matrices A2A E\ 



Ad A \T l2 is 
,-i 



Aj A2 



(i , , , 1 , , where we use the notation Eij from (|2.4D . 

For any % ^ 3, identifying r^2 to the space of 1 x 2 matrices, the matrix of 
\—\ ) in the basis formed by matrices — A^A^ 1 ^ 1 



A,A 2 
(0 — AjA^ 1 ) and E l2 = (l 0). 
• for 3 i,j < d, (Eij) is a basis of T^; it is an eigenvector with eigenvalue 



A, A: 
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• The spaces Tij, for 2 i,j d have respective spectra {A^A^ 1 }, which for 
i ¥= j are pairwise disjoint and different from {1}. 
The concatenation of the bases described above gives a Jordan basis for Ad a- 
Now take a matrix B that is a good match for A, and consider its expression as a 
linear combination of the elements of that Jordan basis. 

Claim A. 4. All coefficients in this linear combination are nonzero, except possibly 
the coefficients of the vectors J\, Ji, J4 and the vectors En, for all 3 ^ i < d. 

The verification is direct. 

Consider now the splitting Matdx<i(C) = V© A, where A is the subspace CJ4© 
E33 © ... © Edd of the space of diagonal matrices, and V is the space spanned by 
all other elements of the above Jordan basis. Note that 



v = (CJi + c j 2 + cj 3 ) © r 

is a decomposition of V into Ad^- invariant subspaces with pairwise disjoint spectra 



Let 7r be the projection onto V along A. It follows from the claim and Lemmas 13.1 



and |3.2| that n(B) is a cyclic vector for Ad^y. So, using the AdA-invariance of 
the spaces V and A, we have 

7r(r) = 7r(sorbAd A (B)) = sorb Ad ^ (tt(B)) = V. 

Note that V contains the matrices Eij, for all i ^ j, hence {Id} v V is a generalized 
Toeplitz space. As n projects along a subspace of diagonal matrices, {Id} v T is 
again a generalized Toeplitz space and in particular is a transitive space. 



We have considered the four types, and Proposition IA.2I is proved. □ 



Appendix B. Complementary facts about singular constant inputs of 

GENERIC TYPE 



In this appendix we give grounds for Remark ll.3l . We also discuss other control 



theoretic properties of generic semilinear systems, related to universal regularity. 

B.l. Local persistence o f sin gular inputs. Let A e C r (U,GL(d,M.)), r ^ 1. 
We will work upon Lemma |2.9| in order to obtain a more practical way to detect 
that the 1-je t of A at a point corresponds to conspicuously poor data. (Recall from 



Remark 16.71 that this is the only type of poor data that appears generically.) For 
example, in the m = 1, d = 2 case, we will see that conspicuous poorness m eans 
that the angular velocity of one of the eigendirections vanishes (see Remark Il3.ll 
below). 

Suppose that uq e hi is such that the matri x A(u q) is diagonalizable over R and 
with simple eigenvalues only. By Proposition 2.1dl there is a neighborhood Uq of 



uq and C r -maps Ai, . . . , Ad : Uq — * C such that for all u e Uq, the complex numbers 
Xi(u) are all distinct, and form the spectrum of A(u); moreover there exist a C r 
map P : Uq —> GL(g?, R) such that for all u e Uq, 

(B.l) A{u) = P(u) A(u) P-\u) , where A(u) = Diag(Ai(u), . . . , A d (w)). 

For simplicity, let us consider first case where U is an interval in R (in par- 
ticular m = 1). Then the normalized derivative of A at a point u can be iden- 
tified with N(u) := A'(u) A _1 (u). Consider the expression of N(u) in the basis 
that diagonalizes A(u), that is, B{u) := P' 1 ^) N(u) P(u). Since ^P _1 (tt) = 
— P _1 (m) P'(u) P~ 1 (w), we compute that 

B(u) = A'(u) A- l (u) + Q(u) - A(u) Q(u) A^u) , 
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where 

Q(u) :=P-\u) P'(u). 
So the off-diagonal entries of the matrices B(u) and Q(u) are related by 

bij{u) = (l - Xi(u)/Xj(u)) qij(u) (i ^ j). 
In view of Lemma l2.9l we conclude the following: if for some e Uq 
(B.2) there is an off-diagonal entry position such that qij(u*) = 
then the 1-jet ^Afa*) is poor. 

Remark B.l. Let us give a geometrical interpretation of condition (|B,2[I . Lhe columns 
of P form a basis (vi, . . . ,Vd) of eigenvectors of A, and the rows of P _1 form a basis 
(/i, . . . , fd) of eigenfunctionals of A (in the sense that fioA = A;/i); these two bases 

are related by fi(vj) = Stj. So qij = fi f-^f-J is the component of the velocity of Vj 
in the direction of Vi. For example, for d = 2, condition (|B.2|) means that one of the 
eigendirections of A has zero angular speed at instant u = u*. 

It is trivial to adapt the previous calculations to the higher dimensional case and 
then conclude the following: 

Proposition B.2. Let (ui, . . . , u m ) be coordinates in a chart domainUo c U where 
expression (|B. 1[) holds. Consider matrices 



c)P 

(B.3) Q k {u):=P-\u)?—(u). 

If for some u* e Uq there is an off-diagonal entry position (i,j) such that 

(B.4) for each k = l,...,m, the (i, j)-entry of the matrix Qk(u*) vanishes 

then the l-jet j 1 A(u%) is poor, that is, the constant input (u*, . . . ,it*) (of any 
length) is singular. 



In the situation of Proposition IB.2L assume additionally that the map 

(B.5) $: < is a diffeomorphism. 

[u ^ [the (z,j)-entry of Qk{u)] 1<k<m 

In that case, the existence of a poor jet is persist ent i n the following way: If 
A is sufficiently C 2 -closc to A then by Proposition 2. ldl we can express A(u) = 
P(u) A(u) P _1 (u) for u close to u*, where P and A are C 2 -close to P and A 
respectively, and A is diagonal. The corresponding matrices Qk = P~ x are 
C7 1 -close to Qk and the map 



[the (z, j>entry of Qfc(tt)j 



$: u 



is C 1 -close to $. By (IB. 51) the fact that = (0, ...,0), there is u close to 

such that $(u) = (0, ...,0). In particular the 1-jet j 1 A(u) is poor. 

Now, concerning existence: It is evident that a domain Uq and 2-jets j 2 P(u*) 
satisfying conditions (|B.4|) and (IB.5|) actually exist; moreover we can always find 
a map P:U — > GL(d, R) with a prescribed 2-jet at a point u*. In view of the 
discussion above, we conclude the following: 

Proposition B.3 (Persistence of singular inputs). For any d ^ 1 and any d- 

dimensional smooth manifold U, there exists a C 2 -open nonempty subset of maps 
A e C 2 (U, GL(d, R)) such that the following holds: 

there exists u e U such that the constant inputs (u, . . . , u) of any length are all 
singular for the system (|1.4p . 
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That is, one cannot improve Theorem ll.ll replacing "discrete set" by "empty 
set". 

We can also see why the statement of Theorem ll.il with "C 2 -open" replaced by 



"C 1 -open" is not true: Given any map A such that (|B.4|) holds at some point, we 
can C ll -perturb A (by C°-perturbing P) in a way such that (IB.4[) now holds for a 
non-discrete set of pointsi 18 ! 

B.2. Other control-theoretic properties. We now introduce a few control- 
theoretic notions related to acce ssibi lity and regularity, and discuss the validity 
of statements similar to Theorem ll.il for these notions. 

Consider a general control system (|1.1|) . Fix a time length N, and let 4>n 
denote the response map as in (|1.2|) . We say that a trajectory determined by 
(x ;u ,...,u N -i) is: 

• locally accessibl^^ if for every neighborhood V of (uq, . . . , u/v— 1) in U N , the 
set ^>at({xo} x V) has nonempty interior. 

• strongly locally accessible if for every neighborhood V of (uq, ■ ■ ■ ,Un-i) in 
IA N , the set 4>n({xq} x V) contains in its interior the final state <^jv(xo; uq, ■ ■ ■ , un- 

The following implications are immediate: 

regular => strongly locally accessible => locally accessible. 

We say that an input (uq, . . . , ujv-i) is universally locally accessible (resp. univer- 
sally strongly locally accessible) if the trajectory determined by (xq; uq, . . . , itjv-i) 
is locally accessible (resp. strongly locally accessible). 

Now we come back to the context of p rojec tive semilinear control systems (|1.4I) . 
A (relatively weak) corollary of Theorem ll.il is that for generic maps A, universal 
local accessibility holds at all constant inputs: 

Proposition B.4. Let N e N and O <= C 2 (U, GL(d,R)) be as in Theorem [71 
For any A e O, every constant input sequence of length N is universally locally 
accessible. 

Proof. If A e O then for every constant input sequence of length N we can find a 
regular input sequence nearby. □ 



As we have shown in Proposition lB.3l it is not possible to improve Proposi- 



tion lB.4l by replacing "local accessibility" by "regularity" . Neither it is possible to 
replace "local accessibility" by "strong local accessibility" , as the following simple 
example (in m = 1, d = 2) shows: 

Example B.5. For ueR, define 

PW= (u 2 l)> A(n) = Diag(2,l). 

Let U be an small open interval containing 0, and define A: U — » GL(2,R) by (|B.1[) . Let 
£o G MP correspond to the direction of the vector (1, 0). Then for any subinterval V 3 0, 
and any N > 0, the set 

<M{£o} x V N ) = {^(«n-i) ■ ■ ■ A(«o) V) 

is an "interval" of RP 1 containing £o = 0jv(£o; 0, . . . , 0) in its boundary. Therefore the 
input (0, . . . , 0) is not universally strongly locally accessible. A similar situation occurs 
for any C 2 -perturbation of A. 



18 Using this idea and Baire's theorem, one can also show that the conclusion of Theorem ll.il 
is not true for C 1 -generic maps A\ actually for C 1 -generic A, the points u e U corresponding to 
singular constant controls form a perfect set. 

^Beware: a different concept with this name appears in .CK2 . 
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Appendix C. Proof of a complex version of Theorem I l.ll 

For the complex case we consider instead holomorphic mappings A : U — > 
GL(d,C). 

More precisely, given an open subset U c C m , we denote by GL(d, C)) the 
set of holomorphic mappings A:U^> GL(d, C) endowed with the usual topology 
of uniform convergence on compact sets. 

Theorem C.l. Given integers d S= 2 and m 5= 1, there exists an integer N ^ 1 
with the following properties. Let IA cz C m be an open subset. Then for any compact 
set K cz hi, there exists an open and dense subset O ofTL(U, GL(d, C)) such that for 
any Ae O the constant inputs in K N are all universally regular for the system (| 1 .4[) , 
except for a finite subset. 

We have the straightforward corollary: 

Corollary C.2. Given integers d 5= 2 and m ^ I, there exists an integer N > 1 
with the following properties. Let U cz C" 1 be an open subset. There exists a residual 
subset 1Z of TL(U, GL(<i, C)) such that for any Ae 1Z the constant inputs in U are 
all universally regular for the system (11.41) , except for a discrete subset. 

These results could probably be obtained in certain more general complex man- 
ifolds. But in order to avoid technicalities, we consider only open subsets of C m . 
Also, we use only elementary real transversality tools. 

Proof of Theorem Let Li cz C m be an open subset. We may identify the set of 
1-jets from U to GL(d, C) with 

U x GL{d,C) x fl l(d,C) m . 

As we did in Section ml and using Theorem instead of Theorem ll.8l . we obtain 
that the set of poor 1-jets from U to GL(d, C) is the algebraic subset U x Vm of 
the space of 1-jets. Hence it admits a stratification 

U x V$ =MxS„d...dWxE . 

Write U x Vm as the disjoint union [_l 0<i<n Xj where each X, is a smooth sub- 
manifold of dimension i in the jet space J 1 (U, GL(d, C)), and X n has codimension 
m. 

Fix now a map A e H(U,GL(d, C)). For all v = (a,6i,...6 m ) e C m+1 and 
u = (ui, * ■ * , u rn ) e C'", write 

m 

P v (u) = a + 2] huk- 

i=l 

d 2 

For all v = (vi t j)x^i,j^d e (C m+1 ) , write P v = [Pv id ] 1<i j <d and define the map 
$ v = A + P v . One can write the 1-jet extension j 1 A at the point u e hi as 

j x A{u) = [u, A(u),B u . . . , B m ] eUx GL(d, C) x [Mat dxd (C)] m . 

The same way, if we put Vij = (a^, 6i,ij, . . . , we have 

j 1 P v (u) = [u,P v (u), (6i,i,j)l«,j«(i) • • • j ( b m,i,j)l*Zi,j*Zd] ■ 
Define the map F : v h- » F v = j 1 ^. The evaluation map of F is: 

F cv. |(C m+1 ) d2 x W - U x Mat dxd (C) x [Mat dxd (C)] m 
)(v,it) i-» F v (u) 
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Hence, 

F cv (v,u)=j\A + P v ) 

= u,(A + P,) (u), (bi,i,j) 1<iJ<d , • . • , (b m ,i,j) u „| 

Claim C.3. For all u, the map F cv restricts to a submersion from the (-,u)-fiber 
to the [u. ■'[-fiber. 

Proof. We want to prove that 

v h> [(A + P v )(u), (h,^)^^ (&m,i,j) ls;ijs j 

is a submersion, or equivalently that 

v h- [P v (u), (h,i,j) 1<iJ<d , • ■ • , {b m ,i,j) 1<iJ<d \ 

is a submersion. Noting that v = (a^j, bk,i,j) , this comes easily from the fact 

that (a,i t j) i— > P v (u) is a submersion, for any fixed set of coefficients (bk,i,j) mj^d . 



That claim immediately implies that F cv is a submersion. In particular it is 
transverse to each Xi. By the parametric transversality theorem (see [Hi I p. 79]), 

d 2 

there is a residual subset of parameters v in (C" l+1 ) such that F v = j 1 ^ is 
transverse to Xi, for all i. 

When v goes to 0, $ v tends to A in H {Li, GL(d, C)). Hence, the denseness in 
H (U, GL(d, C)) of the maps A such that j 1 A is transverse to Xi, for all i. Take 
such a map A: for all i, the image of j x A does not intersect Xq u • • • u X n -\ and 
intersects X n (which has codimension m) only in a discrete subset. 

Fix K' cz U a compact set that contains K in its interior. The image j 1 A 
restricted to K' can only intersect X n in a finite set T: indeed, any accumulation 
point of that intersection set would have to be in Xq u • • • u X n —\, since Xq u . . . u X n 
is closed, and this would contradict that j x A does not intersect JTo u • • • u X n -\. 

By the choice of our topology, a small perturbation A of A is C° close to A by 
restriction to K' . By Cauchy's formula, the map A is C 2 close to A over the set K . 
Hence, the (compact) image of j 1 A restricted to K is still far from Xq u • • • u X n _i, 
and intersects X n transversally in some e- neighborhood of T inside X n . Thus it 
also has to intersect X n in only a finite set. 

So we found an open and dense subset of holomorphic maps whose 1-jets above 
K intersect the set of iV-poor jets in only on a finite number of points. As a 
consequence, for such maps, there are only a finite number constant singular inputs 
in K N for the system 11.41 This concludes the proof of Theorem IC.ll □ 
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